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Description 

Technical Field 

5 [0001] This invention relates to an encoding method and apparatus, a recording medium and a program. More par- 
ticularly, it relates to an encoding method and apparatus, a recording medium and a program in which the management 
information of contents of data recorded on the recording medium is rendered into a file for recording. 

Background Art 

10 

[0002] Recently, a variety of types of optical discs have been proposed as a recording medium that can be removed 
from a recording apparatus. These recordable optical discs have been proposed as a large capacity medium of several 
GBs and are thought to be promising as a medium for recording AV (audio visual) signals, such as video signals. 
Among the digital AV signal sources (supply sources), to be recorded on this recordable optical disc, there are CS 
15 digital satellite broadcast and BS digital broadcast. Additionally, the ground wave television broadcast of the digital 
system has also been proposed for future use. 

[0003] The digital video signals, supplied from these sources, are routinely image-compressed under the MPEG 
(Moving Picture Experts Group) 2 system. In a recording apparatus, a recording rate proper to the apparatus is set. If 
digital video signals of the digital broadcast are recorded in the conventional image storage mediums for domestic use, 
20 digital video signals are first decoded and subsequently bandwidth-limited for recording. In the case of the digital 
recording system, including, of course, the MPEG1 Video, MPEG2 video and DV system, digital video signals are first 
decoded and subsequently re-encoded in accordance with an encoding system for the recording rate proper to the 
apparatus for subsequent recording. 

[0004] However, this recording system, in which the supplied bitstream is decoded once and subsequently band- 
25 width-limited and re-encoded prior to recording, suffers from deteriorated picture quality. If, in recording image-com- 
pressed digital signals, the transmission rate of input digital signals is less than the recording rate for the recording 
and/or reproducing apparatus, the method of directly recording the supplied bitstream without decoding or re-encoding 
suffers from deterioration in the picture quality only to the least extent. However, if the transmission rate of the input 
digital signals exceeds the recording rate of the recording and/or reproducing apparatus, it is indeed necessary to re- 
30 encode the bitstream and to record the re-encoded bitstream, so that, after decoding in the recording and/or reproducing 
apparatus, the transmission rate will be not higher than the upper limit of the disc recording rate. 
[0005] If the bitstream is transmitted in a variable rate system in which the bit rate of the input digital signal is increased 
or decreased with time, the capacity of the recording medium can be exploited less wastefully with a disc recording 
apparatus adapted for transiently storing data in a buffer and for recording the data in a burst fashion than with a tape 
35 recording system having a fixed recording rate imposed by the fixed rpm of the rotary head. 

[0006] Thus, it may be predicted that, in the near future when the digital broadcast is to become the mainstream, an 
increasing demand will be raised for a recording and/or reproducing apparatus in which broadcast signals are recorded 
as digital signals, without decoding or re-encoding, as in a DataStreamer, and in which a disc is used as a recording 
medium. 

40 [0007] If the recording medium is increased in capacity, a larger volume of data (herein the image or speech pertinent 
to a program) can be recorded on the recording medium. So, a larger number of programs are recorded on one disc, 
with the result that the operation of the user selecting a desired one of many programs recorded in the disc is complex. 
Thus, a necessity is felt for enabling a user to confirm data recorded in reproducing the disc to enable a desired program 
(data) to be selected extremely readily. 

45 

Disclosure of the Invention 

[0008] It is therefore an object of the present invention to render the management information for contents of data 
recorded on a recording medium for recording whereby the contents of data recorded on the recording medium and 
50 the replay information can be managed properly. 

[0009] In one aspect, the present invention provides an encoding apparatus for encoding picture data, including an 
encoder for encoding the picture data at a variable rate, and a controller for managing control so that the amount of 
picture coding data will be substantially proportionate to the time lapse. 

[0010] The controller may manage control so that stuffing bytes will be encoded if the amount of the picture coding 
55 data generated per unit time is less than a preset value. 

[001 1] The controller may verify whether or not the stuffing bytes will be encoded depending on the amount of data 
generated in encoding respective pictures. 

[0012] The controller may manage control of encoding the stuffing bytes so that no overflow will be produced in the 
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VBV buffer. 

[0013] The controller may manages control to perform encoding In an encoding mode in which the amount of the 
picture coding data is substantially proportionate to the lapse of time or in a routine encoding mode. 
[0014] The controller may generate the additional information indicating whether or not the encoding mode is such 
encoding mode in which the amount of the picture coding data is substantially proportionate to the lapse of time. 
[0015] In another aspect, the present invention provides a encoding method for encoding picture data, including an 
encoding step of encoding the picture data at a variable rate, and a controlling step of managing control so that the 
amount of picture coding data will be substantially proportionate to the time lapse. 

[0016] In still another aspect, the present invention provides a recording medium having recorded thereon a com- 
puter-readable program used for controlling an encoding apparatus encoding picture data; the program including an 
encoding step of encoding the picture data at a variable rate and a controlling step of managing control so that the 
amount of picture coding data will be substantially proportionate to the time lapse. 

[0017] In still another aspect, the present invention provides a program for causing a computer controlling an en- 
coding apparatus encoding picture data to execute an encoding step of encoding the picture data at a variable rate, 
and a controlling step of managing control so that the amount of picture coding data will be substantially proportionate 
to the time elapsed. 

[0018] In the encoding method and apparatus, recording medium and the program according to the present invention, 
picture data are encoded at a variable rate, and control is managed so that the amount of encoding picture data will 
be approximately proportionate to time elapsed. 

[0019] In yet another aspect, the present invention provides a recording medium having picture data recorded ther- 
eon, the recording medium having recorded thereon an AV stream file including the picture data and audio data asso- 
ciated with the picture data, and a flag indicating the recording mode of the AV stream file. 
[0020] The flag may be time_controlled_flag. 

[0021] The flag may indicate that the mode is such a mode in which the recording is made such that the file size will 
be proportionate to the time elapsed as from the time of recording. 

[0022] In the recording medium according to the present invention, there are recorded picture data, an AV stream 
file containing audio data associated with the picture data and a flag indicating the recording mode of the AV stream file. 
[0023] Other objects, features and advantages of the present invention will become more apparent from reading the 
embodiments of the present invention as shown in the drawings. 

Brief Description of the Drawings 

[0024] 

Fig.1 shows a configuration of an embodiment of a recording and/or reproducing apparatus according to the present 
invention. 

Fig.2 illustrates the data format of data recorded on a recording medium by a recording and/or reproducing appa- 
ratus 1 . 

Fig.3 illustrates Real PlayList and Virtual PlayList. 

Figs.4A, 4B and 4C illustrate the creation of the Real PlayList. 

Figs.5A, 5B and 5C illustrate deletion of the Real PlayList. 

Figs.6A and 6B illustrate assemble editing. 

Fig. 7 illustrates provision of a sub path in the Virtual PlayList. 

Fig.8 illustrates the changing of the playback sequence of the PlayList. 

Fig. 9 illustrates a mark on the PlayList and a mark on the Clip. 

Fig.1 0 illustrates a menu thumbnail. 

Fig. 11 illustrates mark added to the PlayList. 

Fig.1 2 illustrates a mark added to the Clip. 

Fig.1 3 illustrates the relation between the PlayList, Clip and the thumbnail file. 

Fig.1 4 illustrates a directory structure. 

Fig.1 5 illustrates a syntax of infr.dvr. 

Fig.1 6 shows a syntax of DVRVolume. 

Fig.1 7 shows a syntax of Resume Volume. 

Fig.1 8 shows a syntax of UlApplnfoVolume. 

Fig.1 9 shows a table of character set values. 

Fig.20 shows a syntax of TableOf PlayList. 

Fig.2 1 shows another syntax of TableOf PlayList. 

Fig.22 shows a syntax of the Makers PrivateData. 
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Fig. 23 shows a syntax of xxxx.rpls and yyyy .vpls. 

Figs.24A to 24C illustrate the PlayList. 

Fig.25 shows a syntax of PlayList. 

Fig.26 shows a table of PlayList_type. 

Fiq 27 shows a syntax of UlApplnfoPlayList. 

Figs.28A to 28C illustrate flags in the UlApplnfoPlayList syntax shown in Fig.27. 

Fig.29 illustrates a Playltem. 

Fig.30 illustrates a Playltem. 

Fig.31 illustrates a Playltem. 

Fig.32 shows a syntax of the Playltem. 

Fig.33 illustrates IN-time. 

Fig.34 illustrates OUT-time. 

Fig.35 shows a table of Connection_Condition. 

Figs.36A to 36D illustrate Connection_Condition. 

Fig.37 illustrates BridgeSequencelnfo. 

Fig.38 shows a syntax of BridgeSequencelnfo. 

Fig.39 illustrates SubPlayltem. 

Fig.40 shows a syntax of SubPlayltem. 

Fig.41 shows a table of Mark__type. 

Fig.42 shows a syntax of PlayListMark. 

Fig.43 shows a table of Mark_type. 

Fig.44 illustrates Mark_time_stamp. 

Fig.45 shows a syntax of zzzzz.clip. 

Fig.46 shows a syntax of Ciiplnfo. 

Fig.47 shows a table of Clip_stream_type. 

Fig.48 illustrates offset_SPN. 

Fig.49 illustrates offset_SPN. 

Figs.50A, 50B illustrate the STC domain. 

Fig.51 illustrates STCJnfo. 

Fig.52 shows a syntax of STCJnfo. 

Fig.53 illustrates Programlnfo. 

Fig.54 shows a syntax of Programlnfo. 

Fig.55 shows a syntax of VideoCondinglnfo. 

Fig.56 shows a table of Video Jormat. 

Fig.57 shows a table of frame_rate. 

Fig.58 shows a table of display_aspect_ratio. 

Fig.59 shows a syntax of AudioCondinglnfo. 

Fig.60 shows a table of audio_coding. 

Fig.61 shows a table of audio_component_type. 

Fig.62 shows a table of samplingjrequency. 

Fig.63 illustrates CPI. 

Fig.64 illustrates CPI. 

Fig.65 shows a syntax of CPI. 

Fig.66 shows a table of CPI_type. 

Fig.67 illustrates video EP_map. 

Fig.68 illustrates EP_map. 

Fig.69 illustrates EP_map. 

Fig.70 shows a syntax of EP_map. 

Fig.71 shows a table of EP_typevalues. 

Fig.72 shows a syntax of EP_map_for_one_stream_PID. 

Fig.73 illustrates TU_map. 

Fig.74 shows a syntax of TU_map. 

Fig.75 shows a syntax of ClipMark. 

Fig.76 shows a table of Mark_type. 

Fig.77 shows a table of Mark_type_stamp. 

Fig.78 shows a syntax of menu.thmb and mark.thmb. 

Fig.79 shows the syntax of thumbnail. 

Fig.80 shows a table of thumbnail _picture_format. 
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Figs.81Aand 81 Billustrate tn_block. mpfG2 

Fig.82 illustrates a structure of a transport stream of D V n vfTmP EG2 

Fig.83 shows a recorder model of a transport stream * 

Fig 84 shows a player model of a transport stream of DVR MPEG2. 

Fig 85 shows the syntax of a source packet. 

Fig 86 shows the syntax of TP_extra_header. 

Fig.87 shows a table of a copy permission indicator. 

Fig.88 illustrates seamless connection. 

Fig.89 illustrates seamless connection. ^ 

Fig.90 illustrates seamless connection. 

Fig.91 illustrates seamless connection. 

Fig.92 illustrates seamless connection. 

Pin illustrates audio overlap. 

pE'S us a es seamless connection employing BridgeSequence. 
Rg 95 ilusSes seam.ess connection not employing BridgeSequence. 
Fig 96 shows a DVR STD model. 
Fig 97 is a timing chart for decoding and display. 

Fig.1 00 illustrates Video Buffering Verifier. 
Fig.1 01 illustrates VBV control. 
Fiq.1 02 illustrates VBV control. 

Fiq 1 03 shows a case of controlling the variable bit rate. 
Fia 104 shows a case of control of the variable bit rate. 

Fig 1 05 fs a flowchart for illustrating detaiis of step S21 o -fig*. 

S5S is a flowchart for H-ustrating the operation of encoding video with a variable bit rate and of recording an 

Sa^h^ 

lapse the amount of data bytes of the AV stream. 

Fig.111 shows an instance of the minimizing ope ra iotk of mjnimjzing 

Fig.114 is a flowchart for illustrating a typical operation of forming EP.map. 
Fig.1 15 illustrates a medium. 

Best Mode for Carrying out the Invention 

[0 0 25] Referringtothedrawings.presen^ 

\ typical inner structure of a recording ^^SSS^t^^ * ^ 
structure of a recording unit 2, configured ^ •^^^J^,^ ana log or digital data. 

reproducing apparatus 1 is configured for being fed with and recora ng g m The video signals , 

0026] Analog video signals and analog ^^J^^J^Sii IB. The audio signals, input to the 

such as scene changes, from the input video and aw J> «g«*. information (S). such as an 

[00271 The AV encoder 1 5 encodes input video a " d « u *° s 'J n ^ * n ° nron j zat j on , to a multiplexer 1 6. 
Lncoded video stream (V), an encoded ^^^J^l^S^MPEG (Moving Picture Expert Group) 2 
[0028] The encoded video stream is a video ■^™nmD^ In accordance with the MPEG1 system, with the 
system, whi.st the encoded audio stream , • an ^ system or an audio stream encoded 

encoded audio stream being e.g.. an audio s^ en^ 

srr^ ^ a switch 17 to a mu,tip,exed stream 

3f X'Jffi Stream ° f ^ MPEG2 Pr ° 9ram Stream " S ° UrCe 
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packetizer 19 encodes the input multiplexed stream into an AV stream composed of source packets in accordance 
with an application format of a recording medium 100 on which to record the stream. The AV stream is processed in 
ECC (error correction and coding) unit 20 and a modulation unit 21 with appendage of ECC codes and with modulation, 
before being output to a write unit 22 : which then writes (records) an AV stream file based on a control signals output 
5 by the controller 23. 

[0030] The transport stream, such as digital television broadcast, input from a digital interface or a digital television 
tuner, is input to a terminal 13. There are two recording systems for recording the transport stream input to the terminal 
13, one being a transparent recording system and the other being a system in which recording is preceded by re- 
encoding aimed to lower e.g., the recording bit rate. The recording system command information is input from a terminal 

10 24 as a user interface to a controller 23. 

[0031] In the transparent recording of the input transport stream, a transport stream, input to a terminal 13, is output 
through a switch 17 to a multiplexed stream analysis unit 18 and to the source packetizer 19. The ensuing processing 
of recording an AV stream on a recording medium is the same as that of encoding and recording analog input audio 
and video signals, as described above, and hence is not explained here for simplicity. 

is [0032] If the input transport stream is re-encoded and subsequently recorded, the transport stream, input to the 
terminal 13, is fed to a demultiplexer 26, which demultiplexes the input transport stream to extract a video stream (V), 
an audio stream (A) and the system information (S). 

[0033] Of the stream (information), as extracted by the demultiplexer 26, the video stream is output to an audio 
decoder 27, whilst the audio stream and the system information are output to the multiplexer 1 6. The audio decoder 

20 27 decodes the input transport stream to output the encoded video stream (V) to the multiplexer 16. 

[0034] The audio stream and the system information, output from the demultiplexer 26 and input to the multiplexer 
16, and the video stream, output by the AV encoder 15, are multiplexed, based on the input system information, and 
output to the multiplexed stream analysis unit 18 and to the source packetizer 19 through switch 17, as a multiplexed 
stream. The ensuing processing of recording an AV stream on a recording medium is the same as that of encoding 

25 and recording analog input audio and video signals, as described above, and hence is not explained here for simplicity. 
[0035] The recording and/or reproducing apparatus 1 of the present embodiment records a file of the AV stream on 
the recording medium 100, while also recording the application database information which accounts for the file. The 
input information to the controller 23 is the feature information for the moving picture from the analysis unit 14, the 
feature information of the AV stream from the multiplexed stream analysis unit 18 and the user command information 

30 input at a terminal 24. 

[0036] The feature information of the moving picture, supplied from the analysis unit 1 4, is generated by the analysis 
unit 14 when the AV encoder 15 encodes video signals. The analysis unit 14 analyzes the contents of the input video 
and audio signals to generate the information pertinent to the pictures characteristic of the input moving picture signals 
(clip mark). This information is the information indicating a picture of characteristic clip mark points, such as program 
35 start points, scene change points, CM commercial start and end points, title or telop in input video signals, and also 
includes a thumbnail of the picture and the information pertinent to stereo/monaural switching points and muted portions 
of audio signals. 

[0037] The above picture indicating information is fed through controller 23 to the multiplexer 1 6. When multiplexing 
a encoded picture specified as clip mark by the controller 23, the multiplexer 1 6 returns the information for specifying 
40 the encoded picture on the AV stream to the controller 23. Specifically, this information is the PTS (presentation time 
stamp) of a picture or the address information on the AV stream of an encoded version of the picture. The controller 
23 stores the sort of feature pictures and the information for specifying the encoded picture on the AV stream in asso- 
ciation with each other. 

[0038] The feature information of the AV stream from the multiplexed stream analysis unit 18 is the information 
45 pertinent to the encoding information of the AV stream to be recorded, and is recorded by an analysis unit 18. For 
example, the feature information includes the time stamp and address information of the l-picture in the AV stream, 
discontinuous point information of system time clocks, encoding parameters of the AV stream and change point infor- 
mation of the encoding-parameters in the AV stream. When transparently recording the transport stream, input from 
the terminal 13, the multiplexed stream analysis unit 18 detects the picture of the aforementioned clip mark, from the 
so input transport stream, and generates the information for specifying a picture designated by the clip mark and its type. 
[0039] The user designation information from the terminal 24 is the information specifying the playback domain, 
designated by the user, character letters for explaining the contents of the playback domain, or the information such 
as bookmarks or resuming points set by the user for his or her favorite scene. 

[0040] Based on the aforementioned input information, the controller 23 creates a database of the AV stream (Clip), 
55 a database of a group (PlayList) of playback domains (Playltem) of the AV stream, management information of the 
recorded contents of the recording medium 100 (info.dvr) and the information on thumbnail pictures. Similarly to the 
AV stream, the application database information, constructed from the above information, is processed in the ECC unit 
20 and the modulation unit 21 and input to the write unit 22. which then records a database file on the recording medium 
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0042 When the AV stream file recorded on the recording medium ^ 0 .^ S ° f 0 P ^7 eproduced b y a reproducing 
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-di^^^ 

srixr » ■ ™ in9 untt 29 and an ECC 

decoder 30 to the controller 23. ~„t m ii*»r 23 outDuts a list of PlayList recorded on the 

[0043] Based on the application database desired to be reproduced, 

Loring medium 1 00 to a user interface <*£-™™ is input to the contro.ler 23. 
from the list of PlayLists. The information ' pe*nentto^ stream file necessary in reproducing the PlayList. 
The controller 23 commands the readout un 28 ead ^^^^^ %, stream fro m the recording 
in accordance with the command, the readout unrt 28. ead i out the co p a ^ ^ ^ 

^VXSS^Z^£ SSZES the processing by the ECC decoder 30 to a 

audio signal to associated terminals 32, 33, res P^J' Wlth the information instructing random access playback or 
[0045] If fed from the terminal 24, as a user interface, with the i"*™^ fro y m tne recording medium 1 00, 

Ui P>ayt>ac k . the ^^S^^r^iSH ^ml TeTeldout unit 28 to read out the AV 
based on the contents of the database (Clip) of the ^va«Bm. ^ control|er 23 corn- 

stream. If the PlayList as ^^ted b y the user js to f Xe ^ing aTme stamp closest to the specified time point, 
mands the readout unit 28 to read out ^ta from an A picture^ having a P ^ ^ prQgram 

[0046] When the user has selected a certain d P^ , ^'2S?aSSn picture from a list of thumbnail pictures, 
Itored in the ClipMark in the Clip '"^J^^^ points stored in the ClipMark, the controller 

as demonstrated on a user ^^^^^^^^^ 100 to command the readout unit 28 to read 
23 determines the AV stream readout pos Aon ^JJ™™^ unit 28 t0 rea d out data from an l-picture having 
out the AV stream. That is, the controller 23 commands the readout . unit ^ ^ 

an address closest to the address on the AV ^^^^J^^Z *• demodulating unit 29, ECC 

decoder ^ 

[0047] If the user has commanded fast forward playback the omro AV stream 
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the write un* 22 to erase an unneeded strearn porton of the AV «m ^ ^ 

[0051] The case in which the user des,res ; t °^ P ^ n ^t h C 3 ££SU playback domains in a seamless fashion, 
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24 to a controller 23. The controller 23 commands the readout unit 28 to read out data needed to reproduce the pictures 
at the IN-point and at the OUT-point. The readout unit 28 reads out data from the recording medium 100. The data so 
read out is output through the demodulating unit 29, ECC decoder 30 and the source packetizer 1 9 to the demultiplexer 
26. 

5 [0053] The controller 23 analyzes data input to the demultiplexer 26 to determine the re-encoding method for the 
video stream (change of picture_coding_type and assignment of the quantity of encoding bits for re-encoding) and the 
re-multiplexing system to send the system to the AV encoder 15 and to the multiplexer 16. 

[0054] The demultiplexer 26 then separates the input stream into the video stream (V), audio stream (A) and the 
system information (S). The video stream may be classed into data input to the audio decoder 27 and data input to 
10 the multiplexer 1 6. The former is data needed for re-encoding, and is decoded by the audio decoder 27, with the 
decoded picture being then re-encoded by the AV encoder 15 and thereby caused to become a video stream. The 
latter data is data copied from an original stream without re-encoding. The audio stream and the system information 
are directly input to the multiplexer 16. 

[0055] The multiplexer 1 6 multiplexes an input stream, based on the information input from the controller 23, to output 
15 a multiplexed stream, which is processed by the ECC unit 20 and the modulation unit 21 so as to be sent to the write 
unit 22. The write unit 22 records an AV stream on the recording medium 100 based on the control signals supplied 
from the controller 23. 

[0056] The application database information and the operation based on this information, such as playback and 
editing, are hereinafter explained. Fig.2 shows the structure of an application format having two layers, that is PlayList 

20 and Clip, for AV stream management. The Volume Information manages all Clips and PlayLists in the disc. Here, one 
AV stream and the ancillary information thereof, paired together, is deemed to be an object, and is termed Clip. The 
AV stream file is termed a Clip AV stream file, with the ancillary information being termed the Clip Information file. 
[0057] One Clip AV stream file stores data corresponding to an MPEG-2 transport stream arranged in a structure 
prescribed by the application format. By and large, a file is treated as a byte string. The contents of the Clip AV stream 

25 file are expanded on the time axis, with entry points in the Clip (l-picture) being mainly specified on the time basis. 
When a time stamp of an access point to a preset Clip is given, the Clip Information file is useful in finding the address 
information at which to start data readout in the Clip AV stream file. 

[0058] Referring to Fig .3, PlayList is now explained, which is provided for a user to select a playback domain desired 
to be viewed from the Clip and to edit the playback domain readily. One PlayList is a set of playback domains in the 
30 Clip. One playback domain in a preset Clip is termed Play Item and is represented by a pair of an IN-point and an OUT- 
point on the time axis. So, the PlayList is formed by a set of plural Playltems. 

[0059] The PlayList is classified into two types, one of which is Real PlayList and the other of which is Virtual PlayList. 
The Real PlayList co-owns stream portions of the Clip it is referencing. That is, the Real Play List takes up in the disc 
the data capacity corresponding to a stream portion of the Clip it is referencing and, when Real PlayList is erased, the 
35 data of the stream portion of the Clip it is referencing is also erased. 

[0060] The Virtual PlayList is not co-owning Clip data. Therefore, if the Virtual PlayList is changed or erased, the 
contents of the Clip are in no way changed. 

[0061] The editing of the Real Playlist is explained. Fig.4A shows creation of Real PlayList and, if the AV stream is 
recorded as a new Clip, the Real PlayList which references the entire Clip is a newly created operation. 
40 [0062] Fig.4B shows the division of the real PlayList, that is the operation of dividing the Real PlayList at a desired 
point to split the Real PlayList in two Real PlayLists. This division operation is performed when two programs are 
managed in one clip managed by a sole PlayList and when the user intends to re-register or re-record the programs 
as separate individual programs. This operation does not lead to alteration of the Clip contents, that is to division of 
the Clip itself. 

45 [0063] Fig.4C shows the combining operation of the Real PlayList which is the operation of combining two Real 
PlayLists into one new Real PlayList. This combining operation is performed such as when the user desires to re- 
register two programs as a sole program. This operation does not lead to alteration of the Clip contents, that is to 
combining the clip itself into one. 

[0064] Fig.5A shows deletion of the entire Real PlayList. If the operation of erasing the entire preset Real PlayList, 
50 the associated stream portion of the Clip referenced by the deleted Real PlayList is also deleted. 

[0065] Fig.5B shows partial deletion of the Real PlayList. If a desired portion of the Real PlayList is deleted, the 
associated Playltem is altered to reference only the needed Clip stream portion. The corresponding stream portion of 
the Clip is deleted. 

[0066] Fig.5C shows the minimizing of the Real PlayList. It is an operation of causing the Playltem associated with 
55 the Real PlayList to reference only the stream portion of the Clip needed for Virtual PlayList. The corresponding stream 
portion of the Clip not needed for the Virtual PlayList is deleted. 

[0067] If the Real PlayList is changed by the above-described operation such that the stream portion of the Clip 
re f erenceC j by the Real PlayList is deleted, there is a possibility that the Virtual PlayList employing the deleted Clip is 
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present such that problems may be produced in the Virtual PlayList due to the deleted Clip. 

r00681 In order to prevent this from occurring, such a message which runs: "If there exists the Virtual PlayList refer- 
encing the stream portion of the Clip the Real PlayList is referencing, and the Real PlayList is deleted, the Virtual 
PlavList itself is deleted - is it all right?" is displayed for the user in response to the user's operation of deletion by way 
of confirmation or alarming, afterwhich the processing for deletion isexecuted orcancelled subject to user's commands. 
Alternatively the minimizing operation for the Real PlayList is performed in place of deleting the Virtual PlayList. 
r00691 The operation for the Virtual PlayList is now explained. If an operation is performed for the Virtual PlayList, 
the contents of the Clip are not changed. Figs.6A and 6B show the assembling and editing (IN-OUT editing). It is an 
operation of creating Playltem of the playback domain the user has desired to view to create Virtual PlayList. The 
seamless connection between Playltems is supported by the application format, as later explained. 
r00701 If there exist two Real PlayLists 1 , 2 and clips 1 , 2 associated with the respective Real PlayLists, the user 
specifies a preset domain in the Real PlayList 1 (domain from IN1 to OUT1: Playltem 1) as the playback domain, and 
also specifies as the domain to be reproduced next, a preset domain in the Real PlayList 2 (domain from IN2 to OUT2: 
Playltem 2) as the playback domain, as shown in Fig.6A, a sole Virtual PlayList made up of Playltem 1 and the Playltem2 

is prepared, as shown in Fig.6B. » im 

.00711 The re-editing of the Virtual PlayList is now explained. The re-editing may be enumerated by alteration of IN- 
or OUT points in the Virtual PlayList, insertion or appendage of new Playltems to the Virtual PlayList and delet.on of 
Plavltems in the Virtual PlayList. The Virtual PlayList itself may also be deleted. 

f00721 Fig 7 shows the audio dubbing (post recording) to the Virtual PlayList. It is an operation of registering the 
audio post recording to the Virtual PlayList as a sub path. This audio post recording is supported by the app cation 
software An additional audio stream is added as a sub path to the AV stream of the main path of the Virtual PlayList 
r00731 Common to the Real PlayList and the Virtual PlayList is an operation of changing (moving) the playback 
seouence of the PlayList shown in Fig.8. This operation is an alteration of the playback sequence of the PlayList in the 
disc (volume) and is supported by TableOf PlayList as defined in the application format, as will be explained subse- 
quently with reference to e.g., Fig.20. This operation does not lead to alteration of the Clip contents. 
r00741 The mark (Mark) is now explained. The mark is provided for specifying a highlight or charactenst.c time in the 
Clio and in the PlayList, as shown in Fig.9. The mark added to the Clip is termed the ClipMark. The ClipMark is e.g., 
a proqram indexing point or a scene change point for specifying a characteristic scene ascribable to contents in the 
AV stream. The ClipMark is generated by e.g., the analysis unit 14 of Fig.1 . When the PlayList is reproduced, the mark 
of the Clip referenced by the PlayList may be referenced and used. 

r00751 The mark appended to the PlayList is termed the PlayListMark (play list mark). The PlayListMark is e.g. a 
bookmark point or a resuming point as set by the user. The setting of the mark to the Clip and to the PlayList ,s by 
addinq a time stamp indicating the mark time point to the mark list. On the other hand, mark deletion is remov.ng the 
time stamp of the mark from the mark list. Consequently, the AV stream is in no way changed by mark setting or by 

ro076? el /^another format of the ClipMark, a picture referenced by the ClipMark may be specified on the address 
basis in the AV stream. Mark setting on the Clip is by adding the address basis information indicating the picture of the 
mark point to the mark list. On the other hand, mark deletion is removing the address basis information indicating the 
mark point picture from the mark list. Consequently, the AV stream is in no way changed by mark setting or by mark 

?0077i n ' A thumbnail is now explained. The thumbnail is a still picture added to the Volume, PlayList and Clip. There 
are two sorts of the thumbnail, one of them being a thumbnail as a representative picture indicating the contents. This 
is mainly used in a main picture in order for the user to select what the or she desired to view on acting on a cursor, 
not shown Another sort of the thumbnail is a picture indicating a scene pointed by the mark. 

r00781 The Volume and the respective PlayLists need to own representative pictures. The representative pictures 
of the Volume are presupposed to be used for initially demonstrating a still picture representing the disc contents when 
the disc is set in position in the recording and/or reproducing apparatus 1 . It is noted that the disc means the recording 
medium 1 00 which is presupposed to be a of disc shape. The representative picture of the PlayList ,s presupposed to 
be used as a still picture for representing PlayList contents. ....... t «. dio.,i iot 

r00791 As the representative picture of the PlayList. it may be contemplated to use the .nitial picture of the PlayList 
as the thumbnail (representative picture). However, the leading picture at the playback time of 0 is not necessanfy an 
optimum picture representing the contents. So, the user is allowed to set an optional picture as a thumbnail of the 
PlavList Two sorts of the thumbnails, that is the thumbnail as a representative picture ind.cat.ng the Volume and the 
thumbnail as a representative picture indicating PlayList, are termed menu thumbnails. Since the menu thumbnails 
are demonstrated frequently, these thumbnails need to be read out at an elevated speed from the . disc Thus it « 
efficient to store the totality of the menu thumbnails in a sole file. It is unnecessary for the menu thumbnails to be 
pictures extracted from the moving pictures in the volume, but may be a picture captured from a personal computer or 
a digital still camera, as shown in Fig.10. 
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[0097] The present embodiment defines the self-encoding stream format (SESF). The SESF is used for encoding 
analog input signals and for decoding digital input signals for subsequently encoding the decoded signal into an MPEG- 
2 transport stream. 

[0098] The SESF defines an elementary stream pertinent to the M PEG-2 transport stream and the AV stream. When 
5 the recording and/or reproducing apparatus 1 encodes and records input signals in the SESF, an EP_map is created 
and recorded on the disc. 

[0099] A digital broadcast stream uses one of the following systems for recording on the recording medium 100: 
First, the digital broadcast stream is transcoded into an SESF stream. In this case, the recorded stream must conform 
to SESF and EP_map must be prepared and recorded on the disc. 

10 [0100] Alternatively, an elementary stream forming a digital broadcast stream is transcoded to a new elementary 
stream and re-multiplexed to a new transport stream conforming to the stream format prescribed by the organization 
for standardizing the digital broadcast stream. In this case, an EP_map must be created and recorded on the disc. 
[0101] For example, it is assumed that the input stream is an MPEG-2 transport stream conforming to the ISDB 
(standard appellation of digital BS of Japan), with the transport stream containing the HDTV video stream and the 

is MPEG AAC audio stream. The HDTV video stream is transcoded to an SDTV video stream, which SDTV video stream 
and the original AAC audio stream are re -multiplexed to TS. The SDTV stream and the transport stream both need to 
conform to the ISDB format. 

[01 02] Another system of recording the digital broadcast stream on the recording medium 1 00 is to make transparent 
recording of the input transport stream, that is to record the input transport stream unchanged, in which case the 
20 EP_map is formulated and recorded on the disc. 

[0103] Alternatively, the input transport stream is recorded transparently, that is an input transport stream is recorded 
unchanged, in which case TU_map is created and recorded on the disc. 

[0104] The. directory and the file are hereinafter explained. The recording and/or reproducing apparatus 1 is here- 
inafter described as DVR (digital video recording). Fig. 14 shows a typical directory structure on the disc. The directories 
25 of the disc of the DVR may be enumerated by a root directory including "DVR" directory, and the "DVR" directory, 
including "PLAYLIST" directory, "CLIPINF" directory, ,, M2TS" directory and "DATA" directory, as shown in Fig. 14. Al- 
though directories other than these may be created below the root directory, these are discounted in the application 
format of the present embodiment. 

[0105] Below the "DATA" directory, there are stored all files and directories prescribed by the DVR application format- 
so The "DVR" directory includes four directories. Below the "PLAYLIST' directory are placed Real PlayList and Virtual 
PlayList database files. The latter directory may exist in a state devoid of PlayList. 

[0106] Below "CLIPINF" is placed a Clip database. This directory, too, may exist in a state devoid of AV stream files. 
In the "DATA" directory, there are stored files of data broadcast, such as digital TV broadcast. 

[01 07] The "DVR" directory stores the following files: That is, an "info.dvr" is created below the DVR directory to store 
35 the comprehensive information of an application layer. Below the DVR directory, there must be a sole info.dvr. The 
filename is assumed to be fixed to info.dvr. The "menu.thmb" stores the information pertinent to the menu thumbnails. 
Below the DVR directory, there must be 0 or 1 mark thumbnail. The filename is assumed to be fixed to "menu.thmb". 
If there is no menu thumbnail, this file may not exist. 

[0108] The "mark.thmb" file stores the information pertinent to the mark thumbnail picture. Below the DVR directory, 
40 there must be 0 or 1 mark thumbnail. The filename is assumed to be fixed to "menu.thmb". If there is no menu thumbnail, 
this file may not exist. 

[0109] The "PLAYLIST" directory stores two sorts of the PlayList files which are Real PlayList and Virtual PlayList. 
An "xxxxx.rpts" file stores the information pertinent to one Real PlayList. One file is created for each Real PlayList. 
The filename is "xxxxx.rpls", where "xxxxx" denotes five numerical figures from 0 to 9. A file extender must be "rpls". 
45 [0110] The "yyyyy.vpls" stores the information pertinent to one Virtual PlayList. One file with a filename "yyyyyvpls" 
is created from one Virtual PlayList to another, where "yyyyy" denotes five numerical figures from 0 to 9. A file extender 
must be "vpls". 

[0111] The "CLIPINF" directory stores one file in association with each AV stream file. The "zzzzz.clpi" is a Clip 
Information file corresponding to one AV stream file (Clip AV stream file or Bridge-Clip stream file). The filename is 
50 "zzzzz.clpi", where "zzzzz" denotes five numerical figures from 0 to 9. A file extender must be "dpi". 

[01 1 2] The "M2TS n directory stores an AV stream file. The "zzzzz. m2ts° file is an AV stream file handled by the DVR 
system. This is a Clip AV stream file or a Bridge-Clip AV stream file. The filename is "zzzzz. m2ts", where "zzzzz" 
denotes five numerical figures from 0 to 9. A file extender must be "m2ts". 

[0113] The "DATA" directory stores data transmitted from data broadcasting. This data may, for example, be XML 
55 or MPEG files. 

[0114] The syntax and the semantics of each directory (file) are now explained. Fig. 15 shows the syntax of the "info, 
dvr" file. The "info.dvr" file is made up of three objects, that is DVRVoumeQ, TableOf PlayLists() and MakersPrivateData 

()• 
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[0115] The syntax of info. dvr shown in Fig.15 is explained. TheTableOfPlayLists_Start_address indicates the leading 
address of the TableOfPlayLists() in terms of the relative number of bytes from the leading byte of the "info. dvr file. 
The relative number of bytes is counted beginning from 0. 

[0116] The Makers PrivateData_Start_address indicates the leading address of the MakersPrivateData() t in terms 
5 of the relative number of bytes as from the leading byte of the "info.dvr" file. The relative number of bytes is counted 
from 0. The padding_word is inserted in association with the syntax of "info.dvr". N1 and N2 are optional positive 
integers. Each padding word may assume an optional value. 

[0117] The DVRVolume() stores the information stating the contents of the volume (disc). Fig. 16 shows the syntax 
of the DVRVolume. The syntax of the DVRVolume() : shown in Fig.16 : is now explained. The version_number indicates 
10 four character letters indicting the version numbers of the DVRVolume(). The version_number is encoded to "0045" in 
association with IS0646. 

[0118] Length is denoted by 32-bit unsigned integers indicating the number of bytes from directly after the length 
field to the trailing end of DVRVolume(). 

[0119] The ResumeVolume() memorizes the filename of the Real PlayList or the Virtual PlayList reproduced last in 
15 the Volume. However, the playback position when the user has interrupted playback of the Real PlayList or the Virtual 
PlayList is stored in the resume-mark defined in the PlayListMarkQ (see Figs.42 and 43). 

[0120] Fig. 17 shows the syntax of the ResumeVolumeQ. The syntax of the ResumeVolume() shown in Fig. 17 is 
explained. The valid_flag indicates that the resume_PlayList_name field is valid or invalid when this 1 -bit flag is set to 
1 or 0, respectively. 

20 [0121] The 10-byte field of resume_PlayList_name indicates the filename of the Real PlayList or the Virtual PlayList 
to be resumed. 

[0122] The UlApp Info Volume in the syntax of the DVRVolumeQ, shown in Fig. 16, stores parameters of the user 
interface application concerning the Volume. Fig.1 8 shows the syntax of the UlApplnfo Volume, the semantics of which 
are now explained. The 8-bit field of character_set indicates the encoding method for character letters encoded in the 

25 Volume_name field. The encoding method corresponds to the values shown in Fig.1 9. 

[01 23] The 8-bit field of the namejength indicates the byte length of the Volume name indicated in the Volume_name 
field. The Voiume_name field indicates the appellation of the Volume. The number of bytes of the number of the 
namejength counted from left of the field is the number of valid characters and indicates the volume appellation. The 
values next following these valid character letters may be any values. 

30 [0124] The Volume_protect_flag is a flag indicating whether or not the contents in the Volume can be shown to the 
user without limitations. If this flag is set to 1 , the contents of the Volume are allowed to be presented (reproduced) to 
the user only in case the user has succeeded in correctly inputting the PIN number (password). If this flag is set to 0, 
the contents of the Volume are allowed to be presented to the user even in case the PIN number is not input by the user. 
[0125] If, when the user has inserted a disc into a player, this flag has been set to 0, or the flag is set to 1 but the 

35 user has succeeded in correctly inputting the PIN number, the recording and/or reproducing apparatus 1 demonstrates 
a list of the PlayList in the disc. The limitations on reproduction of the respective PiayLists are irrelevant to the 
Volume_protect_flag and are indicated by playback_control_flag defined in the UlApplnfovblume. 
[0126] The PIN is made up of four numerical figures of from 0 to 9, each of which is coded in accordance with ISO/ 
IEC 646. The ref_thumbnait_index field indicates the information of a thumbnail picture added to the Volume. If the 

40 reMhumbnailJndex field is of a value other than OxFFFF, a thumbnail picture is added to the Volume. The thumbnail 
picture is stored in a menu.thumb file. The picture is referenced using the value of the ref_thumbnail_index in the menu, 
thumb file. If the ref_thumbnail_index field is OxFFFF, it indicates that a thumbnail picture has been added to the Volume. 
[0127] The TableOfPlayList() in the info.dvr syntax shown in Fig.15 is explained. The TableOfPlayList() stores the 
filename of the PlayList (Real PlayList and Virtual PlayList). All PlayList files recorded in the Volume are contained in 

45 the TableOfPlayListQ, which TableOfPlayList() indicates the playback sequence of the default of the PlayList in the 
Volume. 

[0128] Fig.20 shows the syntax of the TableOfPlayList(), which is now explained. The version_number of the Table- 
OfPlayListQ indicates four character letters indicating the version numbers of the TableOf Play Lists. The 
version_number must be encoded to "0045" in accordance with ISO 646. 
50 [0129] Length is a unsigned 32-bit integer indicating the number of bytes of the TableOf PiayListQ from directly after 
the length field to the trailing end of the TableOf PlayList(). The 16-bit field of the number_of_Play Lists indicates the 
number of loops of the for-loop inclusive of the PlayList_file_name. This numerical figure must be equal to the number 
of PiayLists recorded in the Volume. The 10-byte numerical figure of the PlayList_file_name indicates the filename of 
the Play Lists. 

55 [0130] Fig.21 shows another configuration of the syntax of the TableOfPlayList(). The syntax shown in Fig.21 is 
comprised of the syntax shown in Fig.20 in which is contained the UlApplnfoPlayList. By such structure including the 
UlApplnfoPlayList, it becomes possible to create a menu picture simply on reading out the TableOfPlayLists. The 
following explanation is premised on the use of the syntax shown in Fig.20. 
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[0131] The MakersPrivateData in the info.dvr shown in Fig. 15 is explained. The Makers Private Data is provided to 
permit the maker of the recording and/or reproducing apparatus 1 to insert private data of the maker in the MakersPri- 
vateData() for special applications of different companies. The private data of each maker has standardized maker_ID 
for identifying the maker who has defined it. The MakersPrivateData() may contain one or more makerJD. 
5 [0132] If a preset maker intends to insert private data, and the private data of a different maker is already contained 
in the MakersPrivateData(), the new private data is added to the Makers PrivateData() without erasing the pre-existing 
old private data. Thus, in the present embodiment, private data of plural makers can be contained in one MakersPri- 
vateData(). 

[0133] Fig.22 shows the syntax of the MakersPrivateData. The syntax of the MakersPrivateData shown in Fig.22 is 
10 explained. The version_number of the TableOf PlayList() indicates four character letters indicating the version numbers 
of the TableOfPlayLists. The version_n umber must be encoded to M 0045 M in accordance with ISO 646. Length is a 
unsigned 32-bit integer indicating the number of bytes of the TableOf PlayListQ from directly after the length field to the 
trailing end of the MakersPrivateData(). 

[0134] The mpd_blocks_start_address indicates the leading end address of the first mpd_block() in terms of the 
15 relative number of bytes from the leading byte of the Makers PrivateData(). The number_of_maker_entries is the 1 6-bit 
codeless integer affording the number of entries of the maker private data included in the MakersPrivateData(). There 
must not be present two or more maker private data having the same makerJD values in the Makers PrivateData(). 
[0135] The mpd_blocks_size is a 16-bit unsigned integer affording one mpd_block size in terms of 1024 bytes as a 
unit. For example, if mpd_block_size = 1, it indicates that the size of one mpd_block is 1024 bytes. The 
20 number_of_mpd__b locks is a 16-bit unsigned integer affording the number of mpd_blocks contained in the MakersPri- 
vateData(). The makeMD is the 1 6-bit unsigned integer indicating the model number code of the DVR system which 
has created the maker private data. The value encoded to the makerJD is specified by the licensor. 
[0136] The maker_model_code is a 16-bit unsigned integer indicating the model number code of the DVR system 
which has created the maker private data. The value encoded to the maker_model_code is set by the maker who has 
25 received the license of the format. The start_mpd_block_n umber is a 1 6-bit unsigned integer indicating the number of 
the mpd_block number at which begins the maker private data. The leading end of the maker private data must be 
aligned with the leading end of the mpd_block. The start_mpd_block_n umber corresponds to a variable j in the for- 
loop of the mpd_block. 

[0137] The mpdjength is a 32-bit unsigned integer indicating the size of the maker private data. The mpd_block is 
30 an area in which is stored maker private data. All of the mpd_blocks in the Makers PrivateData() must be of the same 
size. 

[0138] The real PlayList file and the Virtual PlayList file, in other words, xxxxx.rpls and yyyyy.vpls, are explained. 
Fig. 23 shows the syntax of xxxxx.rpls (Real PlayList) and yyyyy.vpls (Virtual PlayList), which are of the same syntax 
structure. Each of the xxxxx.rpls and yyyyy.vpls is made up of three objects, that is PlayList(), PlayListMark() and 
35 Makers PrivateData(). 

[0139] The PlayListMark_Start_address indicates the leading address of the PlayListMarkQ, in terms of the relative 
number of bytes from the leading end of the PlayList file as a unit. The relative number of bytes is counted from zero. 
[0140] The MakersPrivateData_Start_address indicates the leading address of the Makers PrivateData(), in terms 
of the relative number of bytes from the leading end of the PlayList file as a unit. The relative number of bytes is counted 
40 from zero. 

[0141] The padding_word (padding word) is inserted in accordance with the syntax of the PlayList file, with N1 and 
N2 being optional positive integers. Each padding word may assume an optional value. 

[0142] PlayList will be further explained in the following although it has been explained briefly. A playback domain 
in all Clips except Bridge-Clip must be referred by all PlayLists in the disc. Also, two or more Real PlayLists must not 
45 overlap the playback domains shown by their Playltems in the same Clip. 

[0143] Reference is made to Figs.24A, 24B and 24C. For all Clips, there exist corresponding Real PlayLists, as 
shown in Fig.24A. This rule is observed even after the editing operation has come to a close, as shown in Fig.24B. 
Therefore, all Clips must be viewed by referencing one of Real PlayLists. 

[0144] Referring to Fig.24C, the playback domain of the Virtual PlayList must be contained in the playback domain 
so and in the Bridge-Clip playback domain . There must not be present in the disc Bridge-Clip not referenced by any Virtual 
PlayList. 

[0145] The Real PlayList, containing the list of the Play Item, must not contain SubPlayltem. The Virtual PlayList 
contains the Playltem list and, if the CPI_type contained in the PlayList() is the EP_map type and the PlayList_type is 
0 (PlayList containing video and audio), the Virtual PlayList may contain one SubPlayltem. In the PlayList() in the 
55 present embodiment, the SubPlayltem is used only for audio post recording. The number of the SubPlayltems owned 
by one Virtual PlayList must be 0 or 1 . 

[0146] The PlayList is hereinafter explained. Fig.25 shows the PlayList syntax which is now explained. The 
version_n umber indicates four character letters indicting the version numbers of the PlayListQ. The version_number 
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is encoded to "0045" in association with ISO 646. Length is a 32-bit unsigned integer indicating the total number of 
byte of the PlayList() as from directly after the length field to the trailing end of the PlayList(). The PlayList Jype, one 
example of which is shown in Fig.26, is an 8-bit field indicating the PlayList type. 

r01471 The CPLtype is one-bit flag indicating the value of the CPI_type of the Clip referenced by the Playltem() and 
by the SubPlayltem(). The CPl.types defined in the CPIs of all Clips referenced by one PlayList must be of the same 
values The number_of_Playltems is a 16-bit field indicating the number of Play Items present in the PlayList. 
r0148l' The Playltemjd corresponding to the preset Playltem() is defined by the sequence in which the Playltem() 
appears in the for-loop containing the Playltem(). The Playltemjd begins with 0. The nimber_of_SubPlayltems is a 
16-bit field indicating the number of SubPlayltem in the PlayList. This value is 0 or 1 . An additional audio stream path 
(audio stream path) is a sort of a sub path. _ 
01491 The UlApplnfoPlayList of the PlayList syntax shown in Fig.25 is explained. The UlApplnfoPlayList stores 
parameters of the user interface application concerning the PlayList. Fig.27 shows the syntax of the U lAppInf oPIayList, 
which is now explained. The character_set is an 8-bit field indicating the method for encoding character letters encoded 
in the PlayList name field. The encoding method corresponds to the values conforming to the table shown in Fig.19. 
r01 501 The namejength is an 8-bit field indicating the byte length of the PlayList name indicated in the PlayList.name 
field The PlayList_name field shows the appellation of the PlayList. The number of bytes of the number of the 
namejength counted from left of the field is the number of valid characters and indicates the PlayList appellation. The 
values next following these valid character letters may be any values. 

r01 51 1 The record _time_and_date is a 56-bit field storing the date and time on which the PlayList was recorded This 
field is 14 numerical figures for year/ month/ day/ hour/minute/ second encoded in binary coded decimal (BCD). For 

example, 2001/12/23:01:02:03 is encoded to "0x20011223010203". 

r01 52] The duration is a 24-bit field indicating the total replay time of the PlayList in terms of hour/ minute/ second 
as a unit. This field is six numerical figures encoded in binary coded decimal (BCD). For example, 01 :45:30 is encoded 
to "0x01 4530" 

T01531 The validoeriod is a 32-bit field indicating the valid time periods of the PlayList. This field is 8 numerical 
figures encoded in 4-bit binary coded decimal (BCD). The valid_period is used in the recording and/or reproducing 
apparatus 1 e.g., when the PlayList, for which the valid period has lapsed, is to be automatically erased. For example, 
2001/05/07 is encoded to "0x20010507". 

T01541 The maker JD is a 1 6-bit unsigned integer indicating the maker of the DVR player (recording and/or repro- 
ducing apparatus 1 ) which has been the last to update its PlayList. The value encoded to maker ID is assigned to the 
licensor of the DVR format. The maker.code is a 16-bit unsigned integer indicating the model number of the DVR 
player which has been the last to update the PlayList. The value encoded to the maker.code is determined by the 
maker who has received the license of the DVR format. 

r0155l If the flag of the playback_controlJlag is set to 1 , its PlayList is reproduced only when the user successfully 
entered the PIN number. If this flag is set to 0, the user may view the PlayList without the necessity of inputting the 

roTsei^the writeorotectjlag is set to 1, the contents of the PlayList are not erased nor changed except the 
write protect flag. If this flag is set to 0, the user is free to erase or change the PlayList. If this flag is set to 1, the 
recording and/or reproducing apparatus 1 demonstrates a message requesting re-confirmation by the user before the 
user proceeds to erase, edit or overwrite the PlayList. 

r0157l The Real PlayList, in which the write_protectJlag is set to 0, may exist, the Virtual PlayList, referencing the 
Clip of the Real PlayList may exist, and the write_protectJlag of the Virtual PlayList may be set to 1 . If the user ,s 
desirous to erase the Real PlayList, the recording and/or reproducing apparatus 1 issues an alanrn to the user as to 
the presence of the aforementioned Virtual PlayList or "minimizes" the Real PlayList before eras.ng the Real PlayList. 
r01 581 If isolayed flag is set to 1 , as shown in Fig.28B, it indicates that the PlayList was reproduced at least once 
since it was recorded, whereas, if it is set to 0, it indicates that the PlayList was not reproduced even once since .t was 

r0159i ed Archive is a two-bit field indicating whether the PlayList is an original or a copy, as shown in Fig.28C. The 
field of ref thumbnail Jndex indicates the information of a thumbnail picture representative of the PlayList. If the 
ref thumbnail index field is of a value other than OxFFFF, a thumbnail picture representative of the PlayList is added 
in The PlavList with the PlayList being stored in the menu.thmb file. The picture is referenced using the value of 
ref Jhumbnailjndex in the menu.thmb file. If the ref Jhumbnailjndex field is OxFFFF, no thumbnail picture represent- 
ative of the PlayList is added in the PlayList. 

T01601 The Playltem is hereinafter explained. One Playltem() basically contains the following date. 
Clip information file name for specifying the filename of the Clip, IN-time and OUT-time. paired Itogether to specify 
the playback domain of Clip, STC_sequence_id referenced by .N-time and OUT-time in case the CPI Jype defined in 
PlayListO is EP.map type, and Connection_Condition indicating the connection condition of previous Playltem and 
current Playltem. 
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STC discontinuous points (discontinuous points of the system time base). The clocks of audio samples of the Clip used 
in the sub path are locked to the clocks of the . audio samples of the main path. 

[0177] Fig.40 shows the syntax of the SubPlayltem. Turning to the syntax of the SubPlayltem f shown in Fig.40, the 
field of the Clip Jnformation_file_name indicates the filename of the Clip Information file and is used by a sub path in 
5 the PlayList. The Clip_stream_type defined in this CliplnfoQ must indicate the Clip AV stream. 

[0178] An 8-bit field of sync_Playltem_id indicates the sub path type. Here, only '0x00' is set, as shown in Fig.41 , 
while other values are reserved for future use. 

[0179] The 8-bit field of sync_Playltem_id indicates the Playltemjd of the Playltem containing the time of playback 
start of the sub path on the time axis of the main path. The value of Playltemjd corresponding to the preset Playltem 

W is defined in the Playl_ist() (Fig.25). 

[0180] A 32-bit field of sync_start_PTS_of_ Play Item denotes the time of playback start of the sub path on the time 
axis of the main path, and denotes the upper 32 bits of the PTS (presentation time stamp) on the Playltem referenced 
by the sync_ Playltemjd. The upper 32 bit field of the SubPath_IN_time stores the playback start time of the sub path. 
SubPath JN_time denotes upper 32 bits of the PTS of 33 bits corresponding to the first presentation unit in the sub path . 

15 [0181] The upper 32 bitfield of subPath_OUT_time stores the playback end time of the sub path. SubPath JDUTJime 
indicates upper 32 bits of the value of the Presentation_end_TS calculated by the following equation: 

Presentation_end_TS = PTS_OUT + AU_duration 

20 

where PTS_out is the PTS of the 33 bit length corresponding to the last presentation unit of the SubPath and 
AU_duration is the 90 kHz based display period of the last presentation unit of the SubPath. 

[0182] Next, PlayListMark() in the syntax of xxxxx.rpls and yyyyy.vpls shown in Fig.23 is explained. The mark infor- 
mation pertinent to the PlayList is stored in this PlayListMark. Fig.42 shows the syntax of PlayListMark. Turning to the 

25 syntax of the PlayListMark shown in Fig.42, version_number is four character letters indicating the version number of 
this PlayListMark(). The version_number must be encoded to "0045" in accordance with ISO 646. 
[0183] Length is an unsigned 32 -bit integer indicating the number of bytes of PlayListMark() from directly after the 
length field to the trailing end of the PlayListMark().The number_of_PlayListMarks is a 16-bit unsigned integer indicating 
the number of marks stored in the PlayListMark. The number_of_PlayList Marks may be zero. The mark_type is an 

30 8-bit field indicating the mark type and is encoded in the table shown in Fig.43. 

[0184] A 32-bit filed of mark_time_stamp stores a time stamp indicating the point specified by the mark. The seman- 
tics of the mark_tirne_stamp differ with CPI_type defined in the PlayListQ, as shown in Fig.44. The Playltemjd is an 
8-bit field specifying the Playltem where the mark is put. The values of Playltemjd corresponding to a preset Playltem 
is defined in the PlayListQ (see Fig.25). 

35 [0185] An 8-bit field of character_set shows the encoding method of character letters encoded in the mark_name 
field. The encoding method corresponds to values shown in Fig. 19. The 8-bit field of namejength indicates the byte 
length of the mark name shown in the mark_name field. The mark_name field denotes the mark name indicated in the 
mark_name field. The number of bytes corresponding to the number of namejengths from left of this field is the 
effective character letters and denotes the mark name. In the mark_name field, the value next following these effective 

40 character letters may be arbitrary. 

[0186] The field of the ref Jhumbnailjndex denotes the information of the thumbnail picture added to the mark. If 
the field of the ref Jhumbnailjndex is not OxFFFF, a thumbnail picture is added to its mark, with the thumbnail picture 
being stored in the mark.thmb file. This picture is referenced in the mark.thmb file, using the value of 
ref JhumbnaiUndex, as explained subsequently. If the ref Jhumbnailjndex field is OxFFFF, it indicates that no thumb- 

45 nail picture is added to the mark. 

[0187] The Clip Information file is now explained. The zzzzz.clpi (Clip Information file) is made up of six objects, as 
shown in Fig.45. These are Cliplnfo(), STCJnfo() } Program(), CPI(), ClipMark() and MakersPrivateDataQ. For the AV 
stream (Clip AV stream or Bridge-Clip AV stream) and the corresponding Clip information file, the same string of nu- 
merals "zzzzz" is used. 

so [0188] Turning to the syntax of zzzzz.clpi (Clip Information file) shown in Fig.45 is explained. The 
Cliplnfo_Start_address indicates the leading end address of Cliplnfo() with the relative number of bytes from the leading 
end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from zero. 

[01 89] The STC Jnfo_Start_address indicates the leading end address of STC Jnfo with the relative number of bytes 
from the leading end byte of the zzzzz.clpi file as a unit. The Programlnfo_Start_address indicates the leading end 
55 address of Programlnfo() with the relative number of bytes from the leading end byte of the zzzzz.clpi file as a unit. 
The relative number of bytes is counted from 0. The CPI_Start_address indicates the leading end address of CPI() 
with the relative number of bytes from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes 
is counted from zero. 
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T01901 The ClipMark_Start_address indicates the leading end address of ClipMark() with the relative number of bytes 
from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from zero. 
The MakersSteData Start_address indicates the leading end address of MakersPrivateDataO with the relafve 
number of bvtes from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from 
z^o The padd!n7word is inserted in accordance with the syntax of the zzzzz.clpi file. N1 , N2, N3, N4 and N5 must 
be zero or optional positive integers. The respective padding words may also assume opt.onal values. 
r0191] The Cliplnfo is now explained. Fig.46 shows the syntax of Cliplnfo. Fig.46 shows the syntax of Cl.pl nfo. In 
the CliplnfoO is stored the attribute information of corresponding AV stream files (Clip AV stream or Bndge-Cl.p AV 

TOImT ^Turning to the syntax of the Cliplnfo shown in Fig.46, version.number is the four character letters indicating 
he version number of this Clip.nf o(). The version.number must be encoded to »0045» in accordance wrth the ISO 646 1 
Length is a 32-bit unsigned integer indicating the number of bytes of CliplnfoO from d.rectly at back of the length field 
to the trailing end of the Cliplnfo(). An 8-bit field of Clip_stream_type indicates the type of the AV stream corresponding 
to the Sip information file, as shown in Fig.47. The stream types of the respective AV streams w.ll be expla.ned sub- 

roSr'The 32-bit field of offset.SPN gives an offset value of the source packet number of the first source packet 
number of the first source packet of the AV stream (Clip AV stream orthe Bridge-Clip AV stream). When the AV stream 

file is first recorded on the disc, this offset_SPN must be zero. 

101941 Referring to Fig.48, when the beginning portion of the AV stream file is erased by ed.ting, offset_SPN may 
assume a va.ue other than 0. In the present embodiment, the re.ative source packet "^<^ **J»> ^ 
encinq the offset SPN is frequently described in the form of RSPNxxx, where xxx .s modif.ed such that RSPN_xxx .s 
RAPN EP start "The relative source packet number is sized with the source packet number as a unit and .s counted 
from the first source packet number of the AV stream file with the value of the offset_SPN as the initial value. 
[0195] The number of sourcepacketsfromthefirstsource packet oftheAV stream file tothe source packet referenced 
by the relative source packet number (SPN.xxx) is calculated by the following equat.on: 

SPN_xxx = RSPN_xxx-offset_SPN. 

30 Fiq 48 shows an instance in which offset.SPN is 4. 

[0?961 TS recording rate is a 24-bit unsigned integer, which affords an input/output b,t rate requ.red for the AV 
stream to thi DVR drive (write unit 22) or from the DVR drive (readout unit 28). The record_time_and_date .s a 56-b.t - 
field for storing the date and time of recording of the AV stream corresponding to the Clip and is an encoded represen- 
tation of year/month/day/hour/minute in 4-bit binary coded decimal (BCD) for 14 numencal figures. For example, 

35 2001/2/23 01 02:03 is encoded to "0x20011223010203". 

r0197l The duration is a 24-bit field indicating the total playback time of the Clip by hour/m.nute/second based on 
arrival time clocks. This field is six numerical figures encoded in 4-bit binary coded decimal (BCD). For example, 01 : 
45 30 is encoded to "0x014530". 

r0198l A flag time controlled_flag indicates the recording mode of an AV stream file. If this t.me_controlled flag ,s 
40 1 it indicates that the recording mode is such a mode in which the file size is proportionate to the t.me elapsed smce 
recording, such that the condition shown by the following equation: 

Ts_average_rate* 1 92/1 88*(t-start_time)-a<=size_clip(t) 
45 <= TS_average_rate*1 92/1 88*(t- start_time) + a 

where TS average rate is an average bit rate of the transport stream of the AV stream file expressed by bytes/second. 
Zm In the above equation, t denotes the time in seconds, while startjime is the time po.nt when the first source 
50 packet of the AV stream file was recorded. The size_clip(t) is 10*192 bytes and a is a constant dependent on 

iSoor^timfcontrolled flag is set to 0, it indicates that the recording mode is not controlling so that the time lapse 
of recording is proportionate to the file size of the AV stream. For example, the input transport stream is recorded .n a 

55 lo?oT] a Tumrco n ntrolled_f.ag is set to 1 . the 24-bit field of TS_average_rate indicates the value of TS.average rate 
used n the above equation. If time.controlled.flag is set to 0, this field has no mean.ng and must be se to 0. For 
example the variable bit rate transport stream is encoded by the following sequence: First, the transport rate ,s set to 
The Sue of TS recording rate. The video stream is encoded with a variable bit rate. The transport packet .s mterm.t- 
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tently encoded by not employing null packets. 

[0202] The 32-bit field of RSPN_arrival_time_discontinuity is a relative address of a site where arrival timebase 
discontinuities are produced on the Bridge-Clip AV stream file. The RSPN_arrival_time_discontinuity is sized with the 
source packet number as a unit and is counted with the value of offset_SPN defined in the Cliplnfo() as from the first 
s source packet of the Bridge-Clip AV stream file. An absolute address in the Bridge-Clip AV stream file is calculated 
based on the aforementioned equation: 

SPN_xxx = RSPN_xxx - offset_SPN. 

w 

[0203] The 144-bit field of reserve r_for_system_use is reserved for a system. If is_format_identifier_valid flag is 1 , 
it indicates that the field of formatjdentifier is effective. If is JormatJdentifier_valid flag is 1 , it indicates that the 
formatjdentifier field is valid. If is_original_networkJD_valid flag is 1, it indicates that the field of 
is_transport_stream_ID-valid is valid. If the flag is_transport_stream_ID -valid is 1, it indicates that the 
15 transport_stream_ID field is valid. If is_servece_ID_valid flag is 1 , it indicates that the serveceJD field is valid. 

[0204] If is_country_code_valid flag is 1, it indicates that the field country_code is valid. The 32-bit field of 
formatjdentifier indicates the value of formatjdentifier owned by a registration descriptor (defined in ISO/I EC1 381 8-1 ) 
in the transport stream. The 16-bit field of original_networkJD indicates the value of the original_networkJD defined 
in the transport stream. 

20 [0205] The 16-bit field in serveceJD denotes the value of serveceJD defined in the transport stream. The 24-bit 
field of country_code shows a country code defined by IS03166. Each character code is encoded by IS08859-1 . For 
example, Japan is represented as "JPN" and is encoded to "0x4A 0x50 0x4E". The stream Jormat_name is 1 5 character 
codes of ISO-646 showing the name of a format organization affording stream definitions of transport streams. An 
invalid byte in this field has a value of 'OxFP. 

25 [0206] Formatjdentifier, original_network_ID, transport_streamJD, serveceJD, country_code and 
stream Jo rmat_n am e indicate service providers of transport streams. This allows to recognize encoding limitations on 
audio or video streams and stream definitions of private data streams other than audio video streams or SI (service 
information). These information can be used to check if the decoder is able to decode the stream. If such decoding is 
possible, the information may be used to initialize the decoder system before starting the decoding. 

30 [0207] STCJnfo is now explained. The time domain in the MPEG-2 transport stream not containing STC discontin- 
uous points(discontinuous points of the system time base) is termed the STC_sequence. In the Clip, STC_sequence 
is specified by the value of STC_sequenceJd. Figs.SOA and 50B illustrate a continuous STC domain. The same STC 
values never appear in the same STC_sequence, although the maximum tine length of Clip is limited, as explained 
subsequently. Therefore, the same PTS values also never appear in the same STC_sequence. If the AV stream contains 

35 N STC discontinuous points, where N > 0, the Clip system time base is split into (N+1) STC_sequences. 

[0208] STCJnfo stores the address of the site where STC discontinuities (system timebase discontinuities) are pro- 
duced. As explained with reference to Fig.51, the RSPN_STC_start indicates the address and begins at a time point 
of arrival of the source packet referenced by the (k+1)st RSPN_STC_start and ends at a time point of arrival of the 
last source packet. 

40 [0209] Fig.52 shows the syntax of the STCJnfo. Turning to the syntax of STCJnfo shown in Fig.52, verston_number 
is four character letters indicating the version number of STCJnfo(). The version_number must be encoded to ,, 0045 ,, 
in accordance with ISO 646. 

[0210] Length is a 32-bit unsigned integer indicating the number of bytes of STC_lnfo() from directly after this length 
field to the end of STCJnfo. If CPI Jype of CPI() indicates TU_map type, 0 may be set in this length field. If CPI Jype 

45 of CPI () indicates EP_map type, the num_of_STC_sequence mut be of a value not less than 1 . 

[0211] An 8-bit unsigned integer of num_of_STC_sequence indicates the number of sequences in the Clip. This 
value indicates the number of the for-loops next following the field. The STC_sequenceJd corresponding to the preset 
STC_sequence is defined by the order in which appears the RSPN_STC_start corresponding to the STC_sequence 
in the for-loop containing the RSPN_STC_start. The STC_sequence_id commences at 0. 

50 [0212] The 32-bit field of RSPN_STC_start indicates an address at which the STC_sequence commences on the 
AV stream file. RSPN_STC_start denotes an address where system time base discontinuities are produced in the AV 
stream file. The RSPN_STC_start may also be a relative address of the source packet having the first PCR of the new 
system time base in the AV stream. The RSPN_STC_start is of a size based on the source packet number and is 
counted from the first source packet of the AV stream file with the offset_SPN value defined in Cliplnfo() as an initial 

55 value. In this AV stream file, the absolute address is calculated by the above-mentioned equation, that is 

SPN_xxx= RSPN_xxx-offset_SPN. 
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set to 0. If the CPLtype of CPI() indicates EP.map type, the ™^ rt ^SX nlSS of program sequences in 
in the AV stream file is calculated by: 

SPN_xxx = RSPN_xxx - offset_SPN. 
sponding to vidno_str«,rn_PIO in P "f^'«j- ,„ „. synta> 0 , p rograI „,„lo shovn In Fig.54. Turning 10 the 
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cordance with ISO 646. Length is a 32-bit unsigned integer indicating the number of bytes as from directly after this 
length field to the trailing end of the CPI(). The CPLtype is a 1-bit flag and indicates the CPI type of Clip, as shown in 
Fig.66. 

[0228] The EP_map in the CPI syntax shown in Fig. 65 is explained. There are two types of the EP_map, that is 
5 EP_map for a video stream and an EP_map for an audio stream. The EP_map_type in the EP_map differentiates 
between these EP_map types. If the Clip contains one or more video streams, the EP_map for the video stream must 
be used. If the Clip does not contain a video stream but contains one or more audio streams, the EP_map for the audio 
stream must be used. 

[0229] The EP_map for a video stream is explained with reference to Fig. 67. The EP_jnap for the video stream has 
10 data stream_PID, PTS_EP_start and RSPN_EP_start. The stream_PID shows the PID of the transport packet trans- 
mitting a video stream. The PTS_EP_start indicates the PTS of an access unit beginning from the sequence header 
of the video stream. The RSPN_RP_start indicates the address of a source packet including the first byte of the access 
unit referenced by the PTS_EP_start in the AV stream. 

[0230] A sub table, termed EP_map_for_one_stream_PlD() is created from one video stream transmitted by the 
15 transport packet having the same PID to another. If plural video streams exist in the Clip, the EP_map may contain 
plural EP_map_for_one_stream_PID(). 

[0231] The EP_map for audio stream has data stream_PID, PTS_EP_start and RSPN_EP_start. The stream_PID 
shows a PID of a transport packet transmitting an audio stream. The PTS_EP_start shows the PTS of an accessing 
unit in the audio stream. The RSPN_EP-start indicates an address of a source packet containing a first byte of the 
20 access unit referenced by PTS_EP_start. of the AV stream. 

[0232] The sub table termed EP_map_for_one_stream_PID() is created from one audio stream transmitted by the 
transport packet having the same PID to another. If there exist plural audio streams in the Clip, EP_map may contain 
plural EP_map-_for_one_stream_PID(). 

[0233] Turning to the relation between EP_map and STCJnfo, one EP_map_for_one_stream_PlD() is created in 
25 one table irrespective of discontinuous points in the STC. Comparison of the value of the RSPN_EP_start to the value 
of RSPN_STC_start defined in STCJnfo() reveals the boundary of data of EP_map belonging to respective 
STC_sequences (see Fig. 68). The EP_map must have one EP_map_for_one_stream_PID for a continuous stream 
range transmitted by the same PID. In the case shown in Fig.69, program#1 and program#3 have the same video PID, 
however, the data range is not continuous, so that EP_map _for_one_stream_PlD must be provided for each program. 
30 [0234] Fig. 70 shows the EP_map syntax. By way of explanation of the EP_map syntax shown in Fig.70, the EP_type 
is a 4-bit field and shows the EP_map entry point type, as shown in Fig.71 . The EP_type shows the semantics of the 
data field next following this field. If Clip includes one or more video stream, the EP_type must be set to 0 ('video'). 
Alternatively, if the Clip contains no video stream but contains one or more audio stream, then EP_type must be set 
to 1 ('audio'). 

35 [0235] The 16-bit field of number_of_stream_PIDs indicates the number of times of loops of the for-loop having 
number_of_stream_PIDs in the EP_map() as a variable. The 16-bit field of stream_PID(k) indicates the PID of the 
transport packet transmitting the number k elementary stream (video or audio stream) referenced by 
EP_map_for_one_stream_PID (num_EP_entries(k)). If EP_type is 0 ('video'), its elementary stream must be a video 
stream. If EP_type is equal to 1 ('audio'), its elementary stream must be the audio stream. 

40 [0236] The 16-bit field of num_EP_entries(k) indicates the num_EP_entries(k) referenced by EP_map__entries(k)). 
The EP_map_for_one_stream_PID_Start_address(k): This 32-bit field indicates the relative address position at which 
the EP_map Jor_one_stream_PID(num_EP_entries(k)) begins in the EP_map(). This value is indicated by the size as 
from the first byte of the EP_map(). 

[0237] Padding_word must be inserted in accordance with the EP_map() syntax. X and Y must be optional positive 

45 integers. The respective padding words may assume any optional values. 

[0238] Fig. 72 shows the syntax of EP_map_for_one_stream_PID. By way of explanation of the syntax of the 
EP_map_for_one_stream_PlD shown in Fig.72, the semantics of the 32-bit field of PTS_EP_start differs with the 
EP_type defined by EP_map(). If EP_type is equal to 0 (Video'), this field has upper 32 bits of the 33-bit precision PTS 
of the access unit beginning with a sequence header of the video stream. If the EP_type is equal to 1 ('audio'), this 

50 field has upper 32 bits of PTS of 33 bit precision of the access unit of the audio stream. 

[0239] The semantics of the 32-bit field of RSPN_EP_start differs with the EP_type defined in EP_map(). If EP_type 
is equal to 0 ('video'), this field indicates the relative address of the source packet including the first byte of the sequence 
header of the access unit referenced by the PTS_EP_start in the AV stream. Alternatively, if EP_type is equal to 1 
('audio'), this field indicates the relative address of the source packet containing the first byte in the audio stream of 

55 the access unit referenced by the PTS_EP_start in the AV stream. 

[0240] RSPN_EP_start is of a size which is based on the source packet number as a unit, and is counted from the 
first source packet of the AV stream file, with the value of the offset_SPN : defined in Cliplnfo(), as an initial value. The 
absolute address in the AV stream file is calculated by 
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SPN_xxx = RSPN_xxx - offset_SPN. 



i ftho Dc P M fp start in the syntax must appear in the rising order. 

astom offset_time, this unit being termed tame ^unrt. ^ |r ^ fjrst comp | ete 

[0243] in each time_unit in the AV stream, addresses on the AV mmflta o t P ^ ^ & 

Sorm ire stored in TU_map. These addresses are ^^^J^^i. calcu .ated based on the foliowing 
0)th time.unit on the TU_map_t.me_ax.s is termed TU_start_time(K). 

equation: 

TU_start_time(k) = offset_time + k*time_unit_size. 

[02 44] It is noted that TU.start.timejk) , has aP^»n^ shown Rg ?4 the 32 . bit 

[0245] Fig.74 shows the syntax of TU_map. By way °^™ ,on Jhis v ; lue j ndicates the offset time relative to 
held of offset_time gives an offset tame t ^ * ^ clock derived from the 27 MHz precision 

the first time_unit in the Clip. The offset_t,me « of a s*e ^based on ^ ^ ^ tQ q 

arrival time clocks as unit. If the AV stream "» ^f fhe tie unit, and" is based on 45 kHz clocks, derived 

[0246] The 32-bit field of ^^ m t ^^J^^ t 7me unit size is not ionger than one second 

r e ^™ the number of entries stored in 

[o^he 32-bit fie,d o, BSN.time ^ — the ™^^^J^J^ 

?S absoiute addressTn the AV stream file is calculated by 

SPN_xxx = RSPN_xxx - offset_SPN. 

the number k RSPN_time_unit_start. Fig. 45, the ClipMark is the mark 

KXC-'^ SSS ™« /no, s« * ~ - - * » tooorta, 

(recording and/or reproducing apparatus 1). eX n|ana«on of the ClipMark syntax shown in Fig.75, the 

.„e„dsd in MC.rt.no. iw»J 0» «"J ^^SSha 6me slamp mm*, . poin.ar having a speoiiied .nark. Tha 
[M521 ManV.tima.stamp is a 3ZM nam i ana piavLiaO. as shov<n in Fig.77. 

snntantios o( mark_tlma_stamp d*e.s «nh _<= pl -^" ' ™ ™ £ ™„ lnltoM , lhe S TC_aequenoe Jd ol tha cortin- 
[02531 II CPLtypa in CPU) indicates tha EP_map lyp^mis >b held ^ ^ (|e|(J 

Lous STCdd m ain~rt f*^^^^™^^^, m-h- - chataote, lane,, 
has no meaning but is set to 0. Tne a dii iieiu rorresDO nds to the value shown in Fig.19. 

encoded in the mark.name field. The encoding^ ^^^JJJ, ^ name sh0 wn in the mark_name field. This 
[0254] The 8-bit fie.d of ^^^^ to «» nUmber ° f ^ na ™ J *T 

following these effective charact l^^^^om^Uon Q f the thumbnail picture appended to the mark If 

OxFFFF, a thumbnail picture is added to its ma.. w*h the 
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thumbnail picture being stored in the mark.thumb file. This picture is referenced using the value of ref_thumbnail_index 
in the mark.thumb file. If the reMhumbnaiUndex field is of a value equal to OxFFFF, a thumbnail picture is not appended 
to its mark. 

[0256] MakerPrivateData has already been explained with reference to Fig.22 and hence is not explained here spe- 
5 cifically. 

[0257] Next, thumbnaiLinformation is explained. A thumbnail picture is stored in a menu.thmb file or in a mark.thmb 
file. These files are of the same syntax structure and own a sole Thumbnail(). The menu.thmb file stores a picture 
representing respective PlatyLists. The totality of menu thumbnails are stored in the sole menu.thmb file. 
[0258] The mark.thmb file stores a mark thumbnail picture, that is a picture representing a mark point. The totality 

10 of mark thumbnails corresponding to the totality of PlayLists and Clips are stored in the sole mark.thmb file. Since the 
thumbnails are frequently added or deleted, the operation of addition and partial deletion must be executable readily 
and speedily. For this reason, Thmbnail() has a block structure. Picture data is divided into plural portions each of which 
is stored in one tn_b)ock. One picture data is stored in consecutive tn_blocks. In the string of tn_blocks, there may 
exist a tn_block not in use. The byte length of a sole thumbnail picture is variable. 

'5 [0259] Fig. 78 shows the syntax of menu.thmb and mark.thmb and Fig. 79 the syntax of Thumbnail in the syntax of 
menu.thmb and mark.thmb shown in Fig.78. By way of explanation of the syntax of Thumbnail, shown in Fig.79, 
version_number is four character letters denoting the version number of this Thumbnail(). Version_n umber must be 
encoded to "0045" in accordance with ISO 646. 

[0260] Length is a 32-bit unsigned integer indicating the number of bytes of MakerPrivateData() as from directly at 
20 back of the length field up to the trailing end of Thumbnail(). Tu_block_start_ad dress is a 32-bit unsigned integer 
indicating the leading end byte address of the first tn_block, in terms of the relative number of bytes as from the leading 
end byte of Thumbnail() as a unit. The number of relative bytes is counted from 0. Number_of_thumbnails is a 16-bit 
unsigned integer which gives the number of entries of a thumbnail picture contained in Thumbnail(). 
[0261] Tu_block_size is a 16-bit unsigned integer which gives the size of one tn_block, in terms of 1024 bytes as a 
25 unit. If, for example, tn_block_size = 1 , it indicates that the size of one tn_block is 1024 bytes. Number_of_tn_biocks 
is a 116-bit unsigned integer indicating the number of entries of tn_block in this ThumbnailQ. ThumbnaiLindex is a 
16-bit unsigned integer indicating the index number of the thumbnail picture represented by the thumbnail information 
for one for-loop beginning from the thumbnaiMndex field. The value OxFFFF must not be used as Thumbnail_index. 
This ThumbnaiLindex is referenced by reMhumbnaiUndex in UIApplnfoVolume(), UIApplnfoPlayl_ist(), PlayListMark 
30 () and ClipMark(). 

[0262] Thumbnail_picture_format is an 8-bit unsigned integer representing the picture format of a thumbnail picture 
and assumes a value shown in Fig. 80. In the table, DCF and PNG are allowed only in menu.thumb. The mark thumbnail 
must assume the value of "0x00" (MPEG-2 Video 1 -picture). 

[0263] Picture_data_size is a 32-bit unsigned integer indicating the byte length of a thumbnail picture in terms of 
35 bytes as a unit. Start_tn_block_number is a 16-bit unsigned integer indicating the tn_block number of the tn_block 
where data of the thumbnail picture begins. The leading end of the thumbnail picture data must coincide with the leading 
end of the tn_block. The tn_block number begins from 0 and is relevant to the value of a variable k in the for-loop of 
tn_block. 

[0264] X _picture_length is a 1 6-bit unsigned integer indicating the number of pixels in the horizontal direction of a 
40 frame of a thumbnail picture. Y_picture_length is a 1 6-bit unsigned integer indicating the number of pixels in the vertical 
direction of a frame of a thumbnail picture. Tn_block is an area in which to store a thumbnail picture. All tn_block in 
the ThumbnailO are of the same size (fixed length) and are of a size defined by tn_block_sixe. 

[0265] Figs. 81 A and 81 B schematically show how thumbnail picture data are stored in tn_block. If, as shown in Figs. 
81 A and 81 B, the thumbnail picture begins at the leading end of tn_block, and is of a size exceeding 1 tn_block : it is 
45 stored using the next following tn_block. By so doing, data with a variable length can be managed as fixed length data, 
so that the editing of deletion can be coped with by simpler processing. 

[0266] An AV stream file is now explained. The AV stream file is stored in the "M2TS" directory (Fig.14). There are 
two types of the AV stream file, namely a Clip A stream file and a Bridge-Clip AV stream file. Both AV streams must 
be of the structure of DVR MPEG-2 transport stream file as hereinafter defined. 

50 [0267] First, the DVR MPEG2 transport stream is explained. The structure of the DVR MPEG-2 transport stream is 
shown in Fig.82. The AV stream file has the structure of a DVR MPEG 2 transport stream. The DVR MPEG 2 transport 
stream is made up of an integer number of Aligned units. The size of the aligned unit is 6144 bytes (2048*3 bytes). 
The Aligned unit begins from the first byte of the source packet. The source packet is 192 bytes long. One source 
packet is comprised of TP_extra_header and a transport packet. TP_extra_header is 4 bytes long, with the transport 

55 packet being 1 88 bytes long. 

[0268] One Aligned unit is made up of 32 source packets. The last Aligned unit in the DVR M PEG 2 transport stream 
is also made up of 32 source packets. Therefore, the DVR MPEG 2 transport stream ends at a boundary of the Aligned 
unit. If the number of the transport packets of the input transport stream recorded on a disc is not a multiple of 32, a 
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source packet having a null packet (transport packet of PID = 0x1 FFFF) must be used as the last Aligned unit. The file 
system must not use excess information in the DVR MPEG 2 transport stream. 

[0269] Fig. 83 shows a recorder model of the DVR MPEG 2 transport stream. The recorder shown in Fig. 83 is a 
conceptual model for prescribing the recording process. The DVR MPEG 2 transport stream obeys this model. 

5 [0270] The input timing of the MPEG 2 transport stream is now explained. The input MPEG 2 transport stream is a 
full transport stream or a partial transport stream. The input MPEG 2 transport stream must obey the ISO/IEC1381 8-1 
or ISO/I EC 13818-9. The number i byte of the MPEG 2 transport stream is input simultaneously at time t(i) to T-STD 
(transport stream system target decoder provided for in SO/IEC13818-1) and to the source packetizer. Rpk is an in- 
stantaneous maximum value of the input rate of the transport packet. 

10 [0271] A 27 MHz PLL 52 generates a frequency of 27 MHz clock. The 27 MHz clock frequency is locked at a value 
of the program clock reference (PCR) of the MPEG 2 transport stream. An arrival time clock counter 53 counts the 
pulses of the 27 MHz frequency. Arrival_time_clock(i) is a count value of the arrival time clock counter at time t(i). 
[0272] A source packetizer 54 appends TP_extra_header to the totality of the transport packets to create a source 
packet. Arrival_time_stamp indicates the time when the first byte of the transport packet reaches both the T-STD and 

*s the source packetizer, Arrival_time_stamp(k) is a sampled value of the Arrival_time_clock(k) as represented by the 
following equation: 

arrival_time_stamp(k) = arrival_time_clock(k)% 230 

20 

where k denotes the first byte of the transport packet. 

[0273] If the time separation between two neighboring transport packets is 230/2 7000000 sec (about 40 sec) or 
longer, the difference of the arrival_time_stamp of the two transport packets should be set to 230/2 7000000 sec. The 
recorder is provided for such case. 
25 [0274] A smoothing buffer 55 smoothes the bitrate of the input transport stream. The smoothing buffer must not 
overflow. Rmax is the output bitrate of the source packet from the smoothing buffer when the smoothing buffer is not 
null. If the smoothing buffer is null, the output bitrate of the smoothing buffer is 0. 

[0275] Next, the parameters of the recorder model of the DVR MPEG 2 transport stream are explained. The value 
of Rmax is given by TS_recording_rate as defined in Cliplnfo() associated with the AV stream file. This value may be 
30 calculated from the following equation: 

Rmax = TS_recording_rate*1 92/1 88 

35 where the value of TS_recording_rate is of a size in bytes/second. 

[0276] If the input transport stream is an SESF transport stream, Rpk must be equal to TS_recording_rate as defined 
in CliplnfoQ associated with the AV stream file. If the input transport stream is not an SESF transport stream, reference 
may be made to values defined e.g., in a descriptor of the MPEG 2 transport stream, such as 
maximum_bitrate_descriptor or partial_stream_descriptor for this value. 

40 [0277] If the input transport stream is an SESF transport stream, the smoothing buffer size is 0. If the input transport 
stream is not an SESF transport stream, reference may be made to values defined in the descriptor of the MPEG 2 
transport stream, such as, for example, the values defined in the smoothing^uffer^descriptor, short_smoothing 
_buffer_descriptor or in the partial_transport_stream_descriptor. 

[0278] For the recorder and the player (reproducing apparatus), a sufficient size buffer needs to be provided. The 
45 default buffer size is 1536 bytes. 

[0279] Next, a player model of the DVR MPEG 2 transport stream is explained. Fig. 84 shows a player model of the 
DVR MPEG 2 transport stream. This is a conceptual model for prescribing the reproduction process. The DVR MPEG 
2 transport stream obeys this model. 

[0280] A 27 MHz X-tal 61 generates the frequency of 27 MHz. An error range of the 27MHx frequency must be ±30 
so ppm (2 7000000 ±810 Hz). The arrival time clock counter 62 is a binary counter for counting the pulses of the frequency 
of 27 MHz. Arrival_time_clock(i) is a count value of the arrival time clock counter at time t(i). 

[0281] In the smoothing buffer 64, Rmax is the input bitrate of the source packet to the smoothing buffer when the 
smoothing buffer is not full. If the smoothing buffer is full, the input bitrate to the smoothing buffer is 0. 
[0282] By way of explaining the output timing of the MPEG 2 transport stream, rf the arrival_time_stamp of the current 
55 source packet is equal to 30 bits on the LSB side of arrival_time_clock(i), the transport packet of the source packet is 
removed from the smoothing buffer. Rpk is an instantaneous maximum value of the transport packet rate. The overflow 
of the smoothing buffer is not allowed. 

[0283] The parameters of the player model of the DVR MPEG 2 transport stream are the same as those of the 
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rpcorder model of the DVR MPEG 2 transport stream described above. 

0^841 Fig 85 shows the syntax of the source packet. Transport_packet() is an MPEG transport stream .provided in 
SoTeC MMM The syntax of TP.Extra-header in the syntax of the source packet shown in Fig *5 ,s shown m fig. 
86 Bv way o explaining the syntax of the TP_Extra-header, shown in Fig.86, copy ^ermission_.nd,cator is an integer 
fepr!sen7ng the copying limitation of the payload of the transport packet. The copying limitation may be copy free no 
mole copy copy once or copying prohibited. Fig.87 shows the relation between the value of copy_perm.ss.on_,nd.cator 

WW ZVJeS^Jn^ror is appended to the totality of transport packets If the input transpo^ 
r^oTded usinc Tthe IEEE1394 digital interface, the value of copy_permission_.nd.cator may be assoc.at edwAh the 
JZ o EN (encryption mode indicator). If the input transport stream is recorded without employ ng the .EEE1394 
dfoital interface the value of copy_permission_indicator may be associated with the value of the CCI embedded ,n he 
Sspori pa Set "f an analog *r± input is self-encoded, the value of copy^errnission.indicator may be assorted 

arrival_time_stamp(k) = arrival_time_clock(k)%230. 

[0287] By way of defining the ClipAV stream, the ClipAV stream must have a structure of ^ D VR A MP J^ 
stream deLd as described above. Arrival_time_clock(i)must increase contmuously .n the C hp AV stream, f the e 
exisi a continuous point of the system time base (STC base) in the Clip AV stream, arr,vaLt,me_clock(i) .n the Cl,p 

m 2 Sr Th" m— v^Shedifferent of the arrivaLtime_c.ock(i) between the beginning and the end of the Clip 
AV SLm muT ^26 Thours. This limrtation guarantees that, if there is no discontinuous point in the system t.me base 
SroSJSmTMPM 2 transport stream, the PTS (presentation time stamp) of the same value never appears ,n 
thJ Clip AV stream The MPEG 2 system standard provides that the PTS has a wraparound period of 233/90000 sec 

mm ^By wa'of defining the Bridge-Clip AV stream, the Bridge-Clip AV stream must have a structure of the DVR 
MPEG 2 transport stream defined as described above. The Bridge-Clip AV stream must include a discontinuous pent 
of one aS time base. The transport stream ahead and at back of the discontinuous point of the arnva. t,me base 
must obev the encoding limitations and the DVR-STD as later explained. 

So] The present embodiment supports the video-audio seamless connection setween T^™^ "^Ss 
Seam ess connection between Playltems guarantees "continuous data supply" to the player/decoder and seamless 
^r p ^2.V. The "continuous data supply" is the capability of guaranteeing data supply to the decoder at a 
Se necessa^ to prevent buffer underflow. In order to enable data to be read out from the disc as data real-t.me 
orooert^ aS ^assured data is to be stored in terms of a continuous block of a sufficiently large s.ze as a unit. 
K^TSli decoding processing" means the capability of a player in displaying audio v.deo data recorded 
on the disc without producing pause or gap in the playback output of the decoder. 

ra2921 ^AV sueam, referenced by the seamless connected Playltems, is explained. Whether or not the seam ess 
disolav of a obvious Playltem and the current Play.tem is guaranteed may be verified from the connect,on_cond,t,on 
S c^ned'ir, Ze cu\Zt Playltem. There are two methods for seamless connection of Playltems, that ,s a method 
emDlovinq Bridqe-Clip and a method not employing Bridge-Clip. amn i n ^, nn 
02931 Fig 88 shows the relation between the previous Playltem and the current Hayltem ,n case o ^mptoying 
Bndqiciip in Fig.88, the stream data, read out by the player, is shown shaded, .n Rg.88 TS1 is made up of shaded 
stream data of *e CI i P 1 (Clip AV stream) and shaded stream data previous to RSPN_amvaLt,me_d.scont.nu,ty. 
ST The Ihaded s ream data of Clipf of TS1 is stream data from an address of a stream required for decoding 
IneZe^tifn^ 

nacket referenced by RSPN exit from_previous_Clip. The shaded stream data prior xo 
RSPN zr^TZe discontinuity of Bridge^ contained in TS1 is stream data as from the first source packet of 
so bSSSTT to the source packet directly previous to the source packet referenced by 

SSJ^^S^S^SSTip of shaded stream data of Clip 2 (Clip AV stream) and shaded stream data subse- 
quern S^RSPN arrival time Jiscontinuity of Bridge-Clip. The shaded stream data as from the 
RSPN arriva" time discontinuity of Bridge-Clip contained in TS2 stream data from the source packet referenced by 
55 reP^^S^^nul? to the last source packet of Bridge-Clip. The shaded stream data of C.,p2 of TS2 ,s 
^SSKS packet referenced by RSPN_enter_to_current_C.ip to the address of the stream required 

Z deoodTrfq he presentation unit corresponding to OUT.time of current Playltem (shown by OUT_t.me2 in Rg.88). 
0296^9 89 shows tS relation between the previous Play.tem and the current P.ay.tem in case of not employ.ng 
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Bridge-Clip. In this case, the stream data read out by the player is shown shaded. In Fig.89,TS1 is made up of shaded 
stream data of the Clipl (Clip AV stream). The shaded stream data of Clipl of TS1 is data beginning at an address of 
a stream necessary in decoding a presentation unit corresponding to IN_time of the previous Playltem, shown at 
IN_time1 in Fig.89 as far as the last source packet of Clipl . 
5 [0297] In Fig.89, TS2 is shaded stream data of Clip2 (Clip AV stream). 

[0298] The shaded stream data of Clip2 of TS2 is stream data beginning at a first source packet of Clip2 as far as 
an address of the stream necessary for decoding the presentation unit corresponding to OUT_time of current Playltem 
(shown at OUT_time2 in Fig.89). 

[0299] In Figs.88 and 89, TS1 and T2 are continuous streams of the source packet. Next, the stream provisions of 
10 TS1 and TS2 and the connection conditions therebetween are scrutinized. First, encoding limitations for seamless 
connection are scrutinized. By way of limitations on the encoding structure of a transport stream, the number of pro- 
grams contained in TS1 and TS2 must be 1 . The number of video streams contained in TS1 and TS2 must be 1 . The 
number of audio streams contained in TS and TS2 must be 2 or less. The numbers of the audio streams contained in 
TS1 and TS2 must be equal to each other. It is also possible for elementary streams or private streams other than 
15 those depicted above to be contained in TS1 and/or TS2. 

[0300] The limitations on the video bitstream are now explained. Fig. 90 shows a typical seamless connection indi- 
cated by a picture display sequence. In order for a video stream to be demonstrated seamlessly in the vicinity of a 
junction point, unneeded pictures displayed at back of OUT_time1 (OUT_time of Clipl ) and ahead of IN_time2 (INJime 
of Clip2) must be removed by a process of re-encoding the partial stream of the Clip in the vicinity of the junction point. 
20 [0301] Fig.91 shows an embodiment of realizing seamless connection using BridgeSequence. The video stream of 
Bridge-Clip previous to RSPN_arrival_time_discontinuity is comprised of an encoded video stream up to a picture 
corresponding to OUTJimel of Clipl of Fig.90. This video stream is connected to the video stream of previous Clipl 
and is re-encoded to form an elementary stream conforming to the MPEG2 standard. 

[0302] The video stream of Bridge-Clip subsequent to RSPN_arrival_time_discontinuity is made up of an encoded 
25 video stream subsequent to a picture corresponding to IN_time2 of Clip2 of Fig.90. The decoding of this video stream 
can be started correctly for connecting the video stream to the next following Clip2 video stream. Re-encoding is made 
such that a sole continuous elementary stream conforming to M PEG 2 standard will be formed. For creating Bridge-Clip, 
several pictures in general need to be re-encoded, whilst other pictures can be copied from the original Clip. 
[0303] Fig. 92 shows an embodiment of realizing seamless connection without employing BridgeSequence in the 
30 embodiment shown in Fig.90. The Clipl video stream is comprised of an encoded video stream as far as the picture 
corresponding to OUTJimel of Fig.90 and is re-encoded so as to give an elementary stream conforming to the MPEG2 
standard. In similar manner, the video stream of Clip2 is made up of encoded bitstreams subsequent to the picture 
associated with INJime2 of Clip2 of Fig.90. These encoding bitstreams are already re-encoded to give a sole contin- 
uous elementary stream conforming to the MPEG2 standard. 
35 [0304] By way of explaining encoding limitations of the video stream, the frame rates of the video streams of TS1 
and TS2 must be equal to each other. The video stream of TS1 must be terminated at sequence_end_code. The video 
stream of TS2 must commence at Sequence header, GOP Header and with an l-picture. The video stream of TS2 
must commence at a closed GOP 

[0305] The video presentation units defined in a bitstream (frame or field) must be continuous with a junction point 
40 in-between. No gap of the fields or frames are allowed to exist at junction points. In case of encoding employing 3-2 
pulldown, it may be necessary to rewrite "top_field_first" and "repeatJirsWield" flags. Alternatively, local re-encoding 
may be made to prevent field gaps from being produced. 

[0306] By way of explaining encoding limitations on the audio bitstream, the audio sampling frequency of TS1 and 
that of TS2 must be equal to each other. The audio encoding method of TS1 and that of TS2 (for example, MPEG1 

45 layer 2, AC-3, SESF LPCM and AAC) must be equal to each other. 

[0307] By way of explaining encoding limitations on MPEG-2 transport stream, the last audio frame of the audio 
stream of TS1 must contain audio samples having a display timing equal to the display end time of the last display 
picture of TS1 . The first audio frame of the audio stream of TS2 must contain an audio sample having a display timing 
equal to the display start timing of the first display picture of TS2. 

50 [0308] At a junction point, no gap may be allowed to exist in a sequence of the audio presentation units. As shown 
in Fig.93, there may be an overlap defined by the length of the audio presentation unit less than two audio frame 
domains. The first packet transmitting an elementary stream of TS2 must be a video packet. The transport stream at 
the junction point must obey the DVR-STD which will be explained subsequently. 

[0309] By way of explaining limitations on the Clip and Bridge-Clip, no discontinuities in the arrival time base are 
55 allowed to exist in TS1 or in TS2. 

[0310] The following limitations are applied only to the case of employing the Bridge-Clip. The Bridge-Clip AV stream 
has a sole discontinuous point in the arrival time base only at a junction point of the last source packet of TS1 and the 
first source packet of TS2. The SPN_arrrval_time_discontinuity defined in CliplnfoQ represents an address of the dis- 
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Pin si must represent the display end time of the last video presentat.on un.t of TS1 . The IN_time of the current 
thP riio2 (Clio AV stream file) so that the provisions on data allocation will be met. 

03 ?6 r T SeW stream port on of C.ipl (Clip AV stream file) must be arranged in a continuous area not less than one 
[031 61 Tne last stream k v i- stream file) must be arranged in a continuous area not less 

half fragment. The first [J^^^aES ^ dvr-sS is a conceptuaTmode. for modeling the decoding 

v, a .o s.reaa, „. Pn« ^^^STT- DVR MPEG 2 mnapo-t a.-ean, is 
is „., Z«L as ,ha »a axis a, .h. a™a, «n»l ~ *J^^»,™ I £5, is a., ft. sam* as ,ha 

*.wL, ™ -^^0? sas.« is - »» « 

52U2 Kl™ KS2iSl:55 - » —» - ™ ■ ™> -™- - ° VBSTD 15 

by arrivaLtime.stamp of the arrival time base of TS1 . 
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[0325] The remaining packets of TS1 must be input to buffers of TBn or to TBsys of DVR-STD at a bitrate of 
TS_recording_rate (TS 1 ) . The TS_recording_rate (TS1 ) is the value of TS_recording_rate defined in Cliplnfo() corre- 
sponding to Clipl . The time the last byte of TS1 is input to the buffer is the time T2. So, during the time between time 
T1 and time T2, arrivaLtime_stamp of the source packet is discounted. 
5 [0326] If N1 is the number of bytes of the transport packet of TS1 next following the last video packet of TS1 , the 
time DT1 from time T1 until time T2 is the time necessary for N1 bytes to be input completely at a bitrate of 
TS_recording_rate (TS1), and is calculated in accordance with the following equation: 

1Q DT1 = T2 - T1 = N1/TS_recording_rate. 

[0327] During the time from time T1 until time T2 (TS1), both the values of RXn and RXsys are changed to the value 
of TS-recording_rate (TS1). Except this rule, the buffering operation is the same as that of T-STD. 
[0328] At time T2, the arrival time clock counter is reset to the value of arrival_time_stamp of the first source packet 
15 of TS2. The input timing to the buffer of TB1 , TBn or TBsys of DVR-STD is determined by arrival_time_stamp of the 
source packet of TB2. Both RXn and RXsys are changed to values defined in T-STD. 

[0329] By way of explaining additional audio buffering and system data buffering, the audio decoder and the system 
decoder need to have an additional buffering amount (data amount equivalent to one second) in addition to the buffer 
amount defined in T-STD in order to allow input data of a domain from time T1 to time T2. 
20 [0330] By way of explaining the video presentation timing, the display on the video presentation unit must be con- 
tinuous, that is devoid of gaps, through junction point. It is noted that STC1 is the time axis of the system time base of 
TS1 (indicated as STC1 in Fig. 9), while STC2 is the time axis of the system time base of TS2 (shown at STC2 in Fig. 
97; correctly STC2 begins at time the first PCR of TS2 has been input to the T-STD). 

[0331] The offset between STC1 and STC2 is determined as follows: the PTS1 end is the PTSonSTCI corresponding 
25 to the last video presentation unit of TS2. PTS2start is PTS on STC2 corresponding to the first video presentation unit 
of TS2 and Tpp is the display time period of the last video presentation unit of TS1 , the offset STC_delta between two 
system time bases is calculated in accordance with the following equation: 

STC delta = PTSIend + Tpp - PTS2start. 

30 ~ 

[0332] By way of explanation of the audio presentation timing, there may be overlap in the display timing of the audio 
presentation unit, with the overlap being less than 0 to 2 audio frames (see "audio overlap" shown in Fig. 97). The 
indication as to which of audio samples is to be selected and re-synchronization of the display of the audio presentation 
35 unit to the corrected time base at back of the junction point are set on the player. 

[0333] By way of explaining the system time clock of DVR-STD, the last audio presentation unit of TS1 is displayed 
at time T5. The system time clock may be overlapped between time T2 and time T5. During this time domain, the 
DVR-STD switches the system time clocks between the value of the old time base (STC1) and the value of the new 
time base (STC 2). The value of STC2 may be calculated in accordance with the following equation: 

40 

STC2 = STC1 - STC_delta. 

[0334] The buffering continuity is explained. STC11 video_end is the value of STC on the system time base STC2 
45 when the first byte of the first video packet reaches TB1 of DVR-STD. STC22video_start is the value of STC on the 
system time base STC2 when the first byte of the first video packet reaches TB1 of DVR-STD. STC21 video_end is 
the value of STC11 video_end calculated as the value on STC2 of the system time base STC2. STC21 video_end is 
calculated in accordance with the following equation: 

50 STC21video_end = STC1 1 video_end - STC_delta. 

[0335] In order to obey DVR-STD, the following two conditions must be met: First, the arrival timing of the first video 
packet of TS2 at TB1 must satisfy the following inequality: 

55 

STC22video_start > STC21 video_end + AT1 . 
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[0336] If it is necessary to re-encode and/or multiplex the partial stream of Clipl and/or Clip2, in such a manner that 
the above inequality will be satisfied, this re-encoding or multiplexing is performed as appropriate. 
[0337] Second, the inputting of the video packet from TS1 followed by the inputting of the video packet from TS2 on 
the time axis of the system time base, mapped from STC1 and STC2 on the same time axis, must not overflow or 

5 underflow the video buffer. 

[0338] If the above syntax, data structure and the rules are used as basis, the contents of data recorded on the 
recording medium or the reproduction information can be managed properly to enable the user to confirm the contents 
of data recorded on the recording medium at the time of reproduction or to reproduce desired data extremely readily. 
[0339] Next, detailed description will be given on recording of an AV stream file in a case where time_controlled_flag 

10 in the syntax of Cliplnfo shown in Fig.46 is set to be 1 . In a case where time_controlled_flag is set to be 1 , time lapse 
of the AV stream and the volume of the data bytes of the AV stream have the following relationship. That is , it is assured 
that the time lapse will be proportionate to the volume of the data bytes of the AV stream within a preset error range. 

15 TS_average_rate*1 92/1 88 * (t-a)<= AV_f ile_size(t) 

<= TS_average_rate*1 92/1 88 * (t+a) equation (1 ) 

[0340] The above equation is substantially the same as one shown in the description of time_controlled_f!ag of 

20 Cliplnfo in Fig.46 although its form is slightly different. 

[0341] In the equation, TS_average_rate is an average bit rate of the AV stream file (DVR transport stream file) 
shown by a unit of bytes/seconds, and is indicated by a field of the same name in the Cliplnfo. Also, t is lapse time of 
the arrival time base from the first source packet of the AV stream file shown by a unit of seconds. AV_file_size(t) is 
the size of the AV stream file at time t shown by a unit of bytes, a is a predetermined fixed value, and e.g., 300 seconds. 

25 [0342] The value of TS_average_rate is arbitrarily determined depending on the application of the recorder. For 
example, the values of TS_average_rate are respectively determined for the respective modes in accordance with the 
recording modes such as a long-time picture recording mode (LP mode), a standard picture recording mode (SP mode), 
and a high quality picture recording mode (HQ mode). 

[0343] In a case where the AV stream file is recorded to satisfy the equation (1 ), if the stream is partially erased for 
30 a certain time of the stream, it can be assured to generate on the disc an unoccupied area recordable for a time equal 
to the erased time at a bitrate indicated by the TS_average_rate of the stream in question. For example, if the stream 
is partially erased for a certain time of the AV stream in the SP mode, it is possible to generate on the disc an unoccupied 
area recordable for a time equal to the erased time in the same SP mode. 

[0344] Fig. 98 is a block diagram for illustrating the operation of an AV encoder 1 5 of the recording and/or reproducing 
35 apparatus 1 of Fig.1 in case the variable bit rate is controlled so that the time lapse will be proportionate to the volume 
of the data bytes of the AV stream within a preset error range. The blocks depicted with the same numerals in Fig. 98 
are the same as those in Fig. 1 . 

[0345] First, recording modes, such as LP or SP modes, are input through a user interface 24 to the controller 23. 
The controller 23 is responsive to the recording mode to set average bit rates of video encoding and multiplexing bitrate 

^0 of the AV stream (DVR transport stream) to be recorded (step S20 of the flowchart of Fig. 99). 

[0346] The controller 23 sets time_controlled_flag to 1 to set the average bitrate and the multiplexing bitrate of the 
multiplexed stream to TS_average_rate and TS_recording_rate, respectively. The controller 23 outputs a database of 
the Clip information file in the Cliplnfo of which are set time_controlled_flag s TS_recording_rate and TS_average_rate. 
The Clip Information file is recorded through the processing of the ECC encoding unit 20 explained in Fig.1 . 

45 [0347] In encoding an analog video input, video is input from terminal 11 . In transcoding a video input of the digital 
broadcast, video from the AV decoder 27 is input. The input video is input to a video encoder 151 . The controller 23 
calculates the amount of encoding bits allocated to video per preset time to designate it for the video encoder. The 
video encoder 115 encodes video per preset time to input the amount of bits actually generated to the controller 23. 
The preset time size is the video GOP and is equal to 0.5 sec. Based on the cumulative value as from the start of 

50 encoding of the amount of actually generated encoding bits input from the encoder, the controller 23 controls the 
variable bitrate of video encoding, so that the time lapse of the AV stream will be proportionate to the data byte volume 
of the AV stream within the range of a preset error to calculate the amount of the encoded bits for the video for the 
next preset time. If the controller 23 can be furnished with the relative video encoding difficulty (such as magnitude of 
prediction residues of motion vector prediction or the magnitude of the quantization scale of DCT coefficients) form 

55 the encoder, a variable bitrate can be realized to a higher picture quality. That is, control is made so that the higher the 
relative video encoding difficulty, the larger becomes the amount of allocated encoding bits to the video per preset time. 
[0348] The video encoder 115 inputs a video stream to a multiplexer 16 which is also fed with an audio stream and 
with the system information (S), such as AV synchronization. The encoding processing flow for an audio input or the 
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sy „em intormation (S) .act. as AV synch™**™ Is £ = " t^SSS^l <?. p~« r-ipfcttiag »«rate. 
[0M91 The multiplexer 1 6 multiplexes »«3eo and audio f"^'"™ 5 ^",,^ larget rfcooder (T- STD) ol the 
„ paeketixlop .idea aad audio, comrol mus be made 80 aa ~ X^rldeo acceaa unit (piomree ol 
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stream TS_recording_rate. _ uprBflfi rate less the constant bitrate of audio encoding 

SLS^SS^ — — — on the 

S?' *f r^ng^s" tobetrger than the maximum bitrate of video encoding at a variab.e borate plus the 
constant bitrate for audio encoding plus the multiplexing °™^™sTso that a vide stream will be encoded at a 

s,a ^.^rss.is^ss " — - - — ^ *° 

which then is recorded. WArlf j^ r \ rtf mpfg in case of variable bitrate encoding of 

[0358] The method of controlling the VBV (v.deo buffering ve^ofMPEG, ™"™00) lt „ up t0 the MPEG 
video It is noted that VBV is a theoretical decoder model prescriDeo uyi r v m(=>thod 

(mainly quan- 

encoder to encode the video stream so that VBV will operate correctly ^^^~2Sd a VBV buffer This 
tization control and limitations on the amount of b,ts for * ^^-^S^^^HTmPECK main profi.e 
represents the theoretically necessary mm.mum buffer s.ze for a real decoder. 

main level, the VBV buffer size is 1 .75 Mbits. That R 1Q1 

l0 35 9] As for VBV of MPEG£ - J ^^iT^nJ m -Sm!li bitrate if the variab.e bitrate in the 
illustrates VBV control in case the input bitrate to the > duik ™« ( the VBV buffer is fu „. 

VBV buffer is not full, with the input bitrate to the buffer * ^ ™^ » the VBV buffer is not full, 

in Fig. 101 , the tilt of a rightward ascending line represents the ma*^^rn bitrate o VBR. 

the buffer takeup amount is increased at the maximum bitrate o VBV On * ot er^ Ranged. The 

in the VBV buffer is full, the input bitrate to ^J^^^^^^^^^ is instantly 
abscissa denotes the time axis, T1 md.cat.ng *^^™J^ u "^l^*£ is dec oded each preset time interval 

buffer. lilh nthor nr nnt the stuffina bvte is to be inserted and in calculating 

. VBV_BUFFER_SIZE = 1 .75*1024*1024 bits 

gop_bit_rate: bitrate per GOP [bit/second]. 
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(1) Calculations of the minimum amount of bits of a picture being encoded 

[0362] Take a picture at time d1 of Fig. 102 as an example. First, the bit occupying amount vbv_b of the VBV buffer 

immediately before decoding by the VBV of the picture at time d1 is obtained. Then, a sum tmp corresponding to the 
5 bit occupying amount vbv_d plus the amount of bits input at a bitrate gop_bit_rate as from time d1 until the decoding 

time d2 of the next picture (tau) is calculated. The minimum amount of bits of the picture currently encoded may be 

found from tmp and 

VBV_BUFFEFLSIZE as follows: 

tmp=vbv_b+gop_bit_rate*tau 
10 min_picture_bit = tmp - VBV_BUFFER_SIZE. 

(2) Check as to whether or not byte stuffing is necessary following picture encoding 

[0363] If the actual encoding bit gen_picture_bit of the current picture is smaller than min_picture_bit, stuffing bytes 
15 of a size indicated by the following formula are produced. After the picture currently being encoded, the video encoder 
encodes a number of stuffing bytes equal to the number of num_stuffing_byte. One stuffing byte is an 8-bit code "0000 
0000". 

if (gen_picture_bit < min_picture_bit) 

nunrr stuff ing_byte = (min_picture_bit- gen_picture_bit + 4)/8. 

20 [0364] In the method shown in Fig. 102, it is targeted that control is made so that the video encoder uses the amount 
of bits allocated to the video of a preset time. The input bitrate to the VBV buffer is the encoding bitrate of the current 
GOP, with the video encoder then generating stuffing bytes so as not to produce overflow in the VBV buffer. 
[0365] The VBV control is effective in order that the relation between the time lapse of the AV stream and the amount 
of bytes in the AV stream will be linear within a preset error range, as shown in Fig. 1 03, according to the concept of 

25 the present invention. If the VBV control shown in Fig. 101 is used, the relation shown in Fig. 1 03 cannot be guaranteed 
in the presence of a still picture continuing for long time in the input video. That is, the still picture has a smaller amount 
of the information, so that, if the amount of allocated bits for encoding is made larger than the amount of information, 
the amount of bits generated on actual encoding is saturated at a smaller value. Thus, in such case, the relation 
between time lapse of the AV stream and the amount of data bytes of the AV stream is not linear, as shown in Fig.1 04. 

30 |f, in such case, VBV control shown in Fig. 1 02 is used, the relation between time lapse of the AV stream and the amount 
of data bytes of the AV stream can positively be linear within a preset error range because the video encoder aimed 
to manage control to use the amount of bits allocated to video of a preset time generates stuffing bytes so that the 
input bitrate to the VBV buffer will be the encoding bitrate of the current GOP and so that no overflow will be produced 
in the VBV buffer. 

35 [0366] In the case of Fig. 104, if the AV stream of the time portion of the still picture is erased, it is not possible to 
produce an unoccupied area recordable for a time corresponding to the erased time portion with a bitrate indicated by 
the TS_average_rate of the stream on the disc, because the amount of data bytes of the time portion is smaller than 
the data size corresponding to the average bitrate multiplied with the erasure time. On the other hand, if the stream is 
partially erased for a certain time of the AV stream, it is possible to generate on the disc an unoccupied area recordable 

40 for a time equal to the erased time at a bitrate indicated by the TS_average_rate of the stream in question. 

[0367] Fig. 105 shows a flowchart for illustrating the detailed processing of variable bitrate control of video in the 
processing at step S21 of Fig.99 described above. 

[0368] At step S200, an initial value SV1 is set in an allowance value sv_now of VBR. According to the present 
invention, variable bitrate control is managed so that for guaranteeing that the relation between the time lapse of the 
45 AV stream and the amount of data bytes of the AV stream will be linear within a preset error range, the allowance 
amount sv_now of VBR will be in a range between 0 and the maximum value SVMAX. 
[0369] If, in the above equation (1), a = 300 sec, SV1 and SVMAX are of the following values: 
SV1 = (average bitrate of video)*300 
SVMAX = SV1*2 

50 where the average bitrate of video is of a value as determined at step S20 of Fig.99 (see Fig.1 07). 
[0370] At step S201 , the allocated amount of bits of encoding of the current GOP is calculated. 
[0371] At step S202, it is checked whether or not the following inequality: 
sv_now+b_av-b_alloc < 0 

holds. This check is performed to verify whether or not the allowance value of VBR is negative (-). 
55 [0372] In the above inequality, b_av is an average value of the amount of allocated encoding bits per GOP, as found 
from the average encoding bitrate of video. With the GOP time duration of 0.5 sec, b_av is of the following value: 
b_av = (average bitrate of video)*0.5. 

[0373] If the result of check at step S202 is YES, processing transfers to step S203. If the result of check at step 
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S202 is NO processing transfers to step S204, where b.alloc is set to b_av. Processing then transfers to step S205. 
[0374] At step S203, it is checked whether or not the following inequality holds: 

££f «■»» - vbr --^ ls upd8Md ln a "° rdance ™" h ,he ,0 "° , " n9 

mlsOl At step S300, the amount of encoding bits allocated to the current GOP is converted to the encoding borate 
gop_bit_rate in accordance with the following equation: 

^r^i^^^^ * bits ° f the pi ° ture being encoded in the current gop min - picture - bit 

is found by the following equation: 
tmp = vbv_b + gop_bit_rate*tau 

SwTSiS'iSS SSS. occupying amen, o. me VBV bo«et di.ee,,, ptavious ,o me decoding 
£5?^ SESTEtE ZSL'SiSL me decoding « o, me P« cunantiy encoded end me 
decoding «me of the M ' <P~ F « z 1 |»' VBV 6u(fe( sl2e anU , ,„ lne c „ e of MPEG2 MP@ML. is 1 .75 Mbits. 
53 - — - - 9enW " ea MS gen_P*cture_Wt. 

[0386] At step S303, the following inequality: 
gen_picture_bit < min_picture_bit 

the result of check at step S303 is YES, processing transfers to step S304. If the resuK of check at step 
S303 is NO, processing transfers to step S305. corresponding to the number of 

picture (see Fig.102): 

num_stuffing_byte = (min_picture_bit-gen_p.cture_brt + 4)/8 check 
So, By controlling the variable bitrata encoding d"ba v '^ e ^f^^^^j!^^'^^ m'd^vTOWd"^^^^ by^es^oTtne 

(time_controlled_flag = 0), are now shown transparently recording a transport stream of the AV 

JU^^n^rrS^rcSt^r^P-- fedOtding o, «. AV adeam, 
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[0394] Fig. 108 shows a flowchart for explaining the operation of encoding the video at a variable bitrate to record 
the AV stream in an encoding mode not guaranteeing the proportional relation between time lapse of the AV stream 
and the amount of data bytes of the AV stream. 

[0395] The flowchart is the same as that of Fig. 99 except step S400. 
5 [0396] At step S400, the video encoder 1 51 is controlled so that the video stream will be encoded at a variable bitrate 
in such a manner that the bitrate will be lower than a preset average bitrate from one preset time domain to another. 
[0397] Fig. 109 is a flowchart for illustrating detailed processing of variable bitrate control of video in the processing 
of step S400 of Fig. 108. 

[0398] At step S500, an initial value SV1 is set in the allowance value sv_now of VBR. The variable bitrate control 
10 in this case is made so that the allowance value sv_now will not be negative (-). 

[0399] At step S501 , the allocated bit b_alloc of encoding of the current GOP is found. 
[0400] At step S502, it is checked whether the following inequality: 
sv__now +b_av-b_alloc > 0. 

holds. This step S502 is a check step for verifying whether or not the allowance value of VBV is not minus. 
15 [0401] In the above inequality, b_av is an average value of the amount of encoded bits of encoding per GOP. If the 
time duration of a GOP in question is 0.5 sec, b_av has the following value: 
b_av = (average bitrate of video)*0.5 

[0402] If the result of check at step S502 is YES, processing transfers to step S504.. If the result of check at step 
S502 is NO, processing transfers to step S504 to set b_alloc to b_av. Then, processing transfers to step S504. 
20 [0403] At step s504, the current GOP is encoded. The current GOP is encoded with the amount of allocated bits b- 
alloc. The VBV control in this case is such that, if the bit occupying value in the VBV buffer is not full or full, the input 
bitrate to the buffer will be the maximum bitrate of VBR (variable bitrate) or zero, respectively (see Fig.101). At this 
step, the stuffing bytes are not encoded in the video stream. 

[0404] At step S505, the allowance value of VBR sv_now is updated in accordance with the following formula: 
25 sv_now += b_av - b_gen 

where b_gen is an amount of encoding bits of the current GOP obtained as a result of encoding the current GOP at 
step S504. 

[0405] At step S506, it is checked whether or not the current GOP is the last GOP. If the result of check at step S506 
is YES, processing comes to a close. If the result of check at step S506 is NO, processing reverts to step S501 . 

30 [0406] The recording method of Figs. 108, 1 09 fails to guarantee the proportional relation between time lapse of the 
AV stream and the amount of data bytes of the AV stream within a preset error range. For example, if there is a still 
picture continuing for long time in the input video, the relation between time lapse of the AV stream and the amount of 
data bytes of the AV stream is as shown in Fig. 104. That is, since the still picture is of a smaller information volume, 
the amount of bits generated on actual encoding is saturated at a smaller value, even if the amount of allocated bits 

35 for encoding is larger than the information volume. So, the relation between time lapse of the AV stream and the amount 
of data bytes of the AV stream in ths case is not linear. 

[0407] On the other hand, if, with a view to managing control so that the video encoder will use the amount of bits 
allocated of preset time, control is exercised so that the input bitrate to the VBV buffer is the encoding bitrate for the 
current GOR and so that the stuffing bytes will be generated by the video encoder so as not to produce VBV buffer 
40 overflow, it can be guaranteed that the relation between time lapse of the AV stream and the amount of data bytes of 
the AV stream is substantially linear within a preset error range. 

[0408] As a method of realizing the encoding mode guaranteeing the proportional relation between time lapse of the 
AV stream and the amount of data bytes of the AV stream, it may be contemplated to insert a null-packet in multiplexing 
a transport packet to record a transport stream of a constant bitrate. This is an encoding method mainly used in recording 
45 a transport packet, such as D-VHS. Meanwhile, a null-packet means a transport packet the packet ID (PID) of which 
is set to 0x1 FFF and which has no meaning as information. 

[0409] For reference in comparing the present method to the method of Fig. 99, Fig.110 shows a flowchart of the 
encoding mode in which, by encoding the transport stream of a preset bitrate, the proportional relation between time 
lapse of the AV stream and the amount of data bytes of the AV stream is guaranteed. 

50 [0410] At step S600, the multiplexing bitrate of the transport stream and the bitrate for video encoding are set. At 
step S601 , the video stream is encoded at a preset constant bitrate or at a bitrate lower than the present bitrate. 
[0411] At step S602, a null-packet (a transport packet having no meaning as information) is generated and multi- 
plexed, in case there lacks an elementary stream to be rendered into a transport packet, to encode a transport stream 
of a preset constant multiplexing bitrate. 

55 [0412] At step S603, an arrival time stamp is appended to each transport packet to form a source packet, which 
source packet is then recorded on a recording medium. 

[0413] If the AV stream is recorded as Clip by the above-described recording method, time_controlled_flag of the 
Clip is set to 1 . However, this method, employing a null-packet and which thus fails to use the encoding bits for video 
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encoding efficiently, suffers from the problem of the picture quality being inferior to that achieved with the encoding 
method of Fig. 99 (this has been discussed in detail in e.g., the description of the prior art of Japanese Laying-Open 
Patent Publication H-11 -220727). So, the recording method of Fig. 111 is not recommended in the present invention. 
[0414] The method of partially deleting only a certain time portion of the AV stream file is hereinafter explained. 

5 [0415] Fig. 111 shows examples of an original AV stream file and an AV stream file following editing for erasing a 
partial reproducing portion of the AV stream file. It is assumed that, prior to the editing, Virtual PlayList points to IN_time 
and OUTJime on the original AV stream file. If an editing of erasing the stream portion not used by the Virtual PlayList 
(minimizing editing) is performed, it changes the original AV stream file into an as-edited stream file shown in Fig. 111 . 
The data as from the leading end up to point X of the original AV stream file and the data as from a point Y to the trailing 

10 end are erased. In the following, a typical method of determining these points X and Y is explained. 

[041 6] Fig. 1 1 2 illustrates the method of erasing unneeded data ahead of the point IN without analyzing the AV stream 
contents. PlayList denotes the point IN on the original AV stream file, while showing EP_map of the AV stream. For 
decoding a picture denoted by the point IN, an l-picture beginning from an address ISA2 is required. 
[041 7] At back of the point X, PAT, PMT and PCR packets are required. The PTS of RSPN_EP_start = ISA1 is pts1 , 

15 while the PTS of RSPr\LEP_start = ISA2 is pts2. If the time difference of the system time base of pts1 and that of pts2 
is not less than 1 00 msec, there exist PAT, PMT and PCR packets between the address ISA1 and ISA2 (this holds at 
least for SESF, DVS„ ATSC and ISDB). 

[0418] Therefore, the point X is determined ahead of the address ISA1 . The point X must be on a boundary of an 
aligned unit. The recorder is also able to determine the point X, using EP_map, without analyzing the AV stream 
20 contents, by the following steps: 

(51 ) the step of finding SPN_EP_start having a value of PTS of the past display time closest to PTS of IN time on 
the system timebase; and 

(52) the step of finding SPN_EP_start having a value of PTS of the display time more past at least 1 00 msec than 
25 the value of the PTS of SPN_EP_start as found at step S1 ; 

(53) the point X being determined before SPN_EP_start as found at step S2 : with the point X necessarily lying on 
the boundary of the aligned unit. 

[0419] This method is simple because it is unnecessary to read out data of the AV stream to analyze its contents. 
30 However, there are occasions where unneeded data is left in reproducing the PlayList of the as-edited AV stream. If 
the AV stream data are read out in order to determine the point X and the data contents are analyzed, data unneeded 
for reproducing the PlayList can be erased efficiently. 

[0420] Fig.113 illustrates the method of erasing unneeded data at back of the OUT point without analyzing the AV 
stream contents. PlayList points to the OUT point on the original AV stream, while showing EP_map of the AV stream. 
35 [0421] It is presupposed that the video sequence beginning from SPN_EP_start = ISA4 is 
12 B0 B1 P5 

where I, P and B denote I-, P- and B-pictures, respectively and the suffix numbers denote the display order. If, in this 
processing, the recorder fails to analyze the AV stream contents, it is not aware of the information, such as picture 
coding type or temporal reference, of the pictures referenced by the PTS of OUTJime. The PTS of OUTJime may be 
40 referencing pictures B0 or B1(this cannot be known unless the recorder analyzes the AV stream contents). In such 
case, 12 is necessary in order to decode pictures B0 and B1 . The PTS of 12 is larger than PTS of OUT time. The PTS 
of 12 is larger than PTS of OUT time (OUTJime < pts4 5 where pts4 is PTS of 12). Although PTS of 12 is larger than 
PTS of OUTJime, 12 is required for B0 and B1 . 

[0422] So, the point Y is determined to be at back of an address ISA5 shown. ISA5 is the value of SPN_EP_start 
45 directly at back of ISA4 in EP_map. The point Y also must lie on the boundary of the aligned unit. 

[0423] The recorder may determine the point Y at the next step, using EPjnap, without analyzing the AV stream 
contents, by the following steps:. 

(51) the step of finding SPN_EP_start having a value of PTS of the display time closest to and more future than 
50 PTS pf OUT time on the system time base; and 

(52) the step of finding SPN_EP_start lying directly at back of SPN_EP_start as found on the system time base; 

(53) the point Y being set so as to be more rearwards than SPN_EP_start as found at step S2. The point Y must 
be on the boundary of the aligned unit. 

55 [0424] This method is simple because it is unnecessary to read out AV stream data to analyze its contents to deter- 
mine the point Y. However, there are occasions where unneeded data is left in reproducing the PlayList of the as-edited 
AV stream. If the AV stream data are read out in order to determine the point Y, and the data contents are analyzed, 
data unneeded for reproducing the PlayList can be erased more efficiently. 
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[0425] Referring to the flowchart of Fig. 11 4, a typical operation of the PlayList is explained. This processing is per- 
formed on the multiplexing stream analysis unit 18 of the recording and/or reproducing apparatus shown in Fig.1 . 
[0426] At step S11, the stream analysis unit 18 sets the PID of video of the AV program recorded. If plural videos 
are included in the transport stream, respective video PIDs are set. 

5 [0427] At step S1 2, the stream analysis unit 1 8 receives a video transport packet. 

[0428] At step S13, the stream analysis unit analyzes whether or not the payload of the transport packet (data part 
next following a packet header) begins with the first byte of the PES packet (PES packet is a packet prescribed in 
MPEG2 and packetize an elementary stream). This can be known by checking the value of 
"payload^unit^startjndicator" in the transport packet header. If this value is 1 , the payload of the transport packet 

10 begins with the first byte of the PES packet. If the result of check at step S13 is NO, processing reverts to step S12. 
If the result of check at step S13 is YES, processing transfers to step S14. 

[0429] At step S1 4, the stream analysis unit checks whether or not the payload of the PES packet begins at the first 
byte of sequence_header_code of MPEG video ("0x000001 B3 in 32 bits"). If the result of check at step S14 is NO, 
processing reverts to step S12. If the result of check at step S14 is YES, processing transfers to step S15. 
15 [0430] At step S15, the current transport packet is an entry point. At step S16, the stream analysis unit acquires the 
PID of the video to which belong the PTS of the l-picture beginning from the sequence_header_code and its entry 
point as well as the packet number of the above packet to input the video PID to the controller 23. The controller 23 
forms EP_map. 

[0431] At step S17, it is checked whether or not the current packet is the last input transport packet. If the current 
20 packet is not the last packet, processing reverts to step S12. If the current packet is the last packet, processing is 
terminated. 

[0432] The above-described sequence of operations may be executed not only on a hardware but also on a software. 
If the sequence of operations is to be carried out on the software, the program forming the software or a variety of 
programs are installed on a dedicated hardware of a computer, such that the programs are installed form a recording 

25 medium, such as a general -purpose personal computer. 

[0433] Referring to Fig. 11 5, this recording medium may be constituted by a package medium distributed for furnishing 
the user with a program, in addition to a computer. The package medium may be exemplified by a magnetic disc 221 
(inclusive of floppy disc), an optical disc 222 (inclusive of CD-ROM (Compact Disc- Read Only memory) and DVD 
(Digital versatile Disc)), a magneto-optical disc 223 (inclusive of MD (Mini-Disc)) or a semiconductor memory 224. In 

30 addition, the recording medium may be constituted by a hard disc furnished to the user as it is pre-loaded on a computer 
and which includes a ROM 202 or a memory 208 having the program stored therein. 

[0434] In the present specification, the respective steps stating the sequence of the program furnished by a medium 
includes not only the processing executed chronologically in accordance with the stated order but also the processing 
executed in parallel or batch-wise. 
35 [0435] The system in the present specification denotes an entire apparatus made up of plural devices. 

Industrial Applicability 

[0436] In encoding and recording the AV stream, time_controlled_flag and TS_average_rate are recorded as the 
40 attribute information for the AV stream. If time_controlled_flag is set to 1 , it is guaranteed that the relation between 

time lapse of the AV stream and the amount of data bytes of the AV stream is linear within a preset error range. 

TS_average_rate expresses the average bitrate of the AV stream file (transport stream) in terms of bytes/second as 

a unit, and is determined at a preset value by the application of the recorder. For example, TS_average_rate values 

are set for respective modes, that is long playing (LP) mode, standard playing (SP) mode, or high quality (HQ) mode. 
45 [0437] If time_controlled_flag of the AV stream file is set to 1 , and the stream is erased partially for a preset time 

stream portion, the unoccupied area corresponding to the time portion of the AV stream file of the SP mode, for example, 

can be created on the disc. 

[0438] If time_controlled_f!ag is set to 1 , the AV stream is encoded as follows: 

so (1) A multiplexing bitrate of a transport stream and an average bitrate for video encoding are set. 

(2) A video stream is encoded at a variable bitrate such as to guarantee a preset average bitrate every preset time 
domain. It should be noted that the VBV (video buffering verifier) of MPEG video encoding is controlled in such a 
manner that, for managing control so that the video encoder will use a bit amount allocated to the video of the 
preset time, the input bitrate to the VBV buffer will be the current encoding bitrate, and so that the video encoder 

55 will generate stuffing bytes to inhibit overflow of the VBV buffer. 

(3) The multiplexing is controlled so that, in the absence of an elementary stream to be rendered into a transport 
packet, no null-packet will be produced. 

(4) Arrival time stamp is appended to each transport packet to render it into a source packet and the source packet 
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string is front-padded for recording as an AV stream file. 

*-. ■ ♦u^rv^nnor it ran be Guaranteed that, if a certain time portion 

at a bitrate shown by TS_average_rate can be created on the disc. 
Claims 

! . An encoding apparatus for encoding picture data, comprising: 
to the time lapse. 

or in a routine encoding mode. 

. - l. or^ntmii^r Generates additional information indicating 

6 - izzzxszz « " ts is:: ^™ c — - - pte,ure <°* ns da,a ls 

substantially proportionate to the lapse of time . 
7. An encoding method for encoding picture data, comprising: 

tionate to the time lapse. 

. A recording reedmnr M recorded .hereon a conx, U ,e,re.da. l e program ^ <» -Mho an — , ap- 
' p.ratus encoding pMura dala; aald program composing: 

rss s s satrrr- isrs set— - - - — «■* — 

tionate to the time lapse. 

9 . A program ,or eaaernp a oonrpura, oon-o»,n g an anoodlnp app.ra.oa encode pfcarre dara . errecure 

ria s s :=Ksr__s: __ses set-, - - - — — 

so tionate to the time elapsed. 

™ flag indicating the recording mode of said AV stream fie. 
" n The recording medium according to Cairn 1 0 wherein said Hag is time_control.ed_f.ag. 
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recording is made such that the file size will be proportionate to the time elapsed as from the time of recording. 
Amended claims under Art. 19.1 PCT 

1 . (Amended) An encoding apparatus for encoding picture data, comprising: 



an encoder for encoding said picture data at a variable rate; and 

a controller having an encoding mode in which the volume of encoded data is substantially proportionate to 
10 the elapsed time and another encoding mode in which it is not guaranteed that the volume of encoded picture 

data is proportionate to the elapsed time, said controller controlling the volume of encoded picture data de- 
pending on the pertinent encoding mode. 

2. (Amended) The encoding apparatus according to claim 1 wherein said controller manages control so that stuffing 
15 bytes will be encoded if the amount of said picture coding data generated per unit time is less than a preset value. 

3. The encoding apparatus according to claim 2 wherein said controller verifies whether or not the stuffing bytes will 
be encoded depending on the amount of data generated in encoding respective pictures. 

20 4. The encoding apparatus according to claim 2 wherein said controller manages control of encoding the stuffing 
bytes so that no overflow will be produced in the VBV buffer. 

5. (Amended) The encoding apparatus according to claim 1 wherein said controller manages control to perform en- 
coding in an encoding mode in which the amount of said picture coding data generated is substantially proportionate 

25 to the lapse of time within a preset error range. 

6. The encoding apparatus according to claim 5 wherein said controller generates the additional information indicating 
whether or not the encoding mode is such encoding mode in which the amount of said picture coding data is 
substantially proportionate to the lapse of time . 



7. (Amended) An encoding method for encoding picture data, comprising: 



an encoding step of encoding said picture data at a variable rate; and 

a controlling step having an encoding mode in which the volume of encoded data is substantially proportionate 
35 to the elapsed time and another encoding mode in which it is not guaranteed that the volume of encoded 

picture data is proportionate to the elapsed time, said controlling step controlling the volume of encoded picture 
data depending on the pertinent encoding mode. 

8. (Amended) A recording medium having recorded thereon a computer-readable program used for controlling an 
40 encoding apparatus encoding picture data; said program comprising: 

an encoding step of encoding said picture data at a variable rate; and 

a controlling step having an encoding mode in which the volume of encoded data is substantially proportionate 
to the elapsed time and another encoding mode in which it is not guaranteed that the volume of encoded 
45 picture data is proportionate to the elapsed time, said controlling step controlling the volume of encoded picture 

data depending on the pertinent encoding mode. 

9. (Amended) A program for causing a computer controlling an encoding apparatus encoding picture data to execute 

an encoding step of encoding said picture data at a variable rate; and 
50 a controlling step having an encoding mode in which the volume of encoded data is substantially proportionate 

to the elapsed time and another encoding mode in which it is not guaranteed that the volume of encoded picture 
data is proportionate to the elapsed time, said controlling step controlling the volume of encoded picture data 
depending on the pertinent encoding mode. 

55 10. (Amended) A recording medium having picture data recorded thereon, said recording medium having recorded 
thereon an AV stream file including said picture data and audio data associated with said picture data; and 

a flag indicating whether or not encoding has been made in an encoding mode in which the file size of said 
AV stream is substantially proportionate to the lapse of time. 
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11. (Amended) The recording medium according to claim 10 wherein said flag is time_controlled_flag. 

12. (Deleted) 
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